Fig. 1: Vaccine intent classifier. | Nature Communications

Fig. 1: Vaccine intent classifier.

From: Measuring vaccination coverage and concerns of vaccine holdouts from web search logs

Fig. 1

a Our computational approach centers on query-click graphs constructed from billions of Bing search logs. b Using these graphs, we introduce a three-step pipeline to identify vaccine intent URLs: generate URL candidates via Personalized PageRank; present URL candidates to annotators; and expand the final set of URLs with graph neural networks. Each step improves our coverage of users and correlation with CDC vaccination rates (Table 1). c Our vaccine intent estimates are highly correlated with state vaccination rates from the CDC. Here, we compare cumulative rates up to August 31, 2021 (r = 0.86). d Our estimates are also highly correlated with CDC rates over time (r = 0.89, median over states), with the CDC time series lagging by 7 and 15 days (IQR). Here, we visualize time series for the 4 largest states in the US, with extended results in “Comparison to reported vaccination rates”.

Back to article page