User:Stu/comScore data on Wikimedia

From Meta, a Wikimedia project coordination wiki

Jump to: navigation, search

Contents

One of the major online audience measurement companies, comScore, Inc., has generously donated access to its Media Metrix and World Metrix data sets to the Foundation. comScore has an opt-in panel of two million internet users around the globe and uses a range of statistical techniques to create an internally consistent portrait of the global internet audience. As an example, below is a chart summarizing comScore's estimated audience over the past few years for wikipedia.org, both in the U.S. and worldwide.

Wikipedia.org audience trend.jpg

[edit] March 2009 data

comScore estimates that, during the month of March 2009, 327 million unique visitors (UVs) viewed our projects from a personal computer, which it estimates was a "reach" of 29.9% of the 1.09 billion worldwide PC-based web browser audience:

Worldwide unique visitors
Google Sites (includes YouTube) 831 million
Microsoft Sites 692 million
Yahoo! Sites 594 million
Wikimedia Foundation Sites 327 million
FACEBOOK.COM 295 million
AOL LLC 286 million
eBay 250 million
CBS Corporation (includes CNET) 203 million
Amazon Sites 195 million
Fox Interactive Media (includes Myspace) 183 million

Facebook moved into the #5 spot this month, passing AOL.

[edit] Geographic breakdown

comScore estimates our audience in different regions, and also estimates what percentages of the audience within each region visited one of our sites:

Unique visitors Reach in region
Worldwide 327.1 million 29.9%
Europe 126.5 million 40.9%
Asia Pacific 78.0 million 17.8%
--India 7.0 million 21.0%
--China 2.8 million 1.4%
North America 66.9 million 36.0%
--United States 54.8 million 33.1%
Latin America 34.2 million 41.5%
Middle East - Africa 21.6 million 28.2%

[edit] Language breakdown

comScore estimates visitors to the different language versions of Wikipedia and estimates the unique visitors worldwide:

Worldwide unique visitors
English Wikipedia 166.0 million
Spanish Wikipedia 30.6 million
Japanese Wikipedia 28.0 million
French Wikipedia 23.7 million
German Wikipedia 23.2 million
Portuguese Wikipedia 12.2 million
Italian Wikipedia 10.6 million
Russian Wikipedia 9.5 million
Arabic Wikipedia 8.4 million
Vietnamese Wikipedia 4.9 million
Chinese language wikipedias 4.7 million
Korean Wikipedia 2.5 million
Indian language wikipedias .3 million

This month Spanish surpassed Japanese to become the second most visited Wikipedia after English. Other notables changes included an increase in the Arabic Wikipedia from 6.8 million in February to 8.4 million in March.

[edit] Demographic breakdown

comScore's panelists report age and sex so it can generate detailed demographic estimates, including raw data and also an index which measures the extent to which a set of visitors to our sites is over or under-represented compared to visitors to all sites on the internet. For March, comScore estimates our 327 million audience is made up of 181 million men (30.7% of men online) and 146 million women (29.1% of women online). We index slightly higher with men (102) than with women (97). Here's a breakdown of different age groups:

Worldwide unique visitors Reach in
age group
Index
Ages 15-24 87 million 29.7% 99
Ages 25-34 75 million 26.2% 88
Ages 35-44 68 million 29.0% 97
Ages 45-54 54 million 34.9% 117
Ages 55+ 42 million 34.7% 116

We index highest for older users (ages 45-54 and 55+) and lowest for those 25-34 years old. I dug into this issue, at first thinking it was driven by twenty-something preference for YouTube and Facebook. As far as I can tell, though, our comparatively weak performance in the 25-34 year old demographic is the result of our weakness in China where comScore believes there is a huge audience in that age range. For example, Tencent, Baidu and SINA all index above 120 for this demo while Facebook, Google overall, and Yahoo are in the 90s while both MySpace and YouTube are with us down in the 80s.

[edit] Project breakdown

comScore estimates our audience by project:

Worldwide unique visitors
Wikipedia 324.7 million
Wiktionary 8.6 million
Wikimedia Commons 5.5 million
Wikibooks 3.8 million
Wikisource 2.9 million
Wikiquote 2.6 million
Wikinews .6 million
Wikiversity .5 million
Wikispecies .2 million

[edit] China, India trends

comScore estimates the unique visitors to our sites from home and office users in China (excluding Taiwan and Hong Kong). In July of 2008, comScore estimated 232,000 UVs to our sites in China. In August, the month of the 2008 Beijing Olympics, comScore estimates we had 1.3 million visitors. By March, the audience estimate was 2.75 million, comprised of 1.8 million UVs to one of the Chinese language wikipedias and 0.8 million to the English Wikipedia. By contrast, comScore estimates the Baidu Encyclopedia had 39 million visitors from within China in March. Given that comScore does not track internet usage from public locations (e.g. internet cafes), these estimates certainly undercount overall activity from China.

In India, comScore estimates 7.0 million unique visitors came to our sites, or 21% of internet users in India. Of these, 6.9 million visited the English Wikipedia while just over 100,000 visited one of the different Indian language wikipedias.

[edit] Source of traffic

comScore also provides analysis of the site a user surfs just prior to visiting us. The percentage of these "entries" from Google and other search engines is often used as an indicator of reliance on the search engines for traffic. Other major sites like YouTube, eBay or Facebook typically see entries from Google at 10% to 15% of their traffic while we are typically over 50%. Here's a breakdown of the top 4 for us:

Entries % of total entries
Google Sites (includes YouTube) 1,491 million 57.6%
Yahoo! Sites 147 million 5.7%
Microsoft Sites 106 million 4.1%
Logon 28 million 1.1%

[edit] Portal usage

We worked with comScore to include estimates on usage of the Wikipedia portal at www.wikipedia.org. There's a wide range of usage across geographies:

Unique visitors to WP Unique visitors to portal % of UVs
using portal
Worldwide 324.7 million 15.2 million 4.7%
Europe 125.4 million 4.0 million 3.2%
Asia Pacific 77.6 million 4.1 million 5.2%
North America 66.4 million 5.0 million 7.4%
Latin America 34.0 million 1.2 million 3.5%
Middle East - Africa 21.3 million .9 million 4.4%

[edit] Trend data

I've put together a PDF of comScore's estimates of monthly unique visitors to Wikimedia Foundation Sites since Sep 2007. Contact me at stu[at]wikimedia.org if you'd like a worksheet with the underlying data.

[edit] Participation estimates

I wanted a sense of what percentage of our audience actively participates. comScore gives good data on unique visitors, and Erik Zachte and others compile counts of registered users who have made at least five edits in a month, which seems a reasonable threshold for active participation as it would eliminate some casual or accidental editors. With data coming from two different data sources it's a bit apples-and-oranges, but is still useful.

The table below shows the calculations for the biggest few Wikipedias. Due to the size of the English Wikipedia, editor compilations happen infrequently so the most recent data covers September of 2008. On the English Wikipedia only about .03% of the unique visitors actively edit. Put another way, that's less than one-third of one-tenth of one percent. If you include all users who made at least one edit, it's about triple that amount or just under .1%.

Sep '08 UVs from comScore Sep '08 editors with 5+ edits % of UVs
with 5+ edits
English Wikipedia 140,710,255 41,393 0.029%
Japanese Wikipedia 25,698,145 4,390 0.017%
Spanish Wikipedia 25,388,063 4,016 0.016%
German Wikipedia 20,435,314 7,144 0.035%
French Wikipedia 16,428,023 4,602 0.028%
Portugese Wikipedia 10,787,686 1,710 0.016%
Italian Wikipedia 8,637,544 3,208 0.037%
Russian Wikipedia 6,534,903 2,672 0.041%
Source: UV stats from comScore, editor stats for English Wikipedia from http://en.wikipedia.org/wiki/Wikipedia:Editing_frequency, stats for other wikipedias from http://stats.wikimedia.org/EN/TablesWikipediansEditsGt5.htm

[edit] Analysis from earlier months

[edit] Discussion of comScore & Wikimedia

Jay Walsh on the Foundation's staff is managing the comScore relationship overall, Erik Zachte is helping drive the statistical analysis, and a volunteer named Josh Holman has a lot of experience with comScore data. Feel free to reach out to me or any of them with questions. If there's interest, we'll try to update this page every month or two as new data comes out.

comScore is just one of the different internet measurement options (others include Alexa, Compete, Google Insights for Search, Google Trends, Hitwise, Nielsen, Quantcast, server log analysis).

Finally a quick thank you to comScore. The data they donated typically sells for thousands and thousands of dollars, so we're lucky to be able to review. Speaking on behalf of all of us in the community, I want to thank them for their support.

[edit] Benefits

  • comScore has a large and professional team dedicated to audience measurement. We are able to benefit from their insights with no coding, no servers, no hard drives stuffed full of log data, and almost no effort.
  • comScore reports "unique visitors", which estimates the actual people using the internet. This puts things into more human terms than page views or an ambiguous "traffic rank" metric. Also, comScore works hard to exclude bots, crawlers, mirrors, click farms, etc.
  • comScore works to combine different domains and subdomains. This is particularly useful for international properties. For example, we are able to generate a single audience number for all five or so Chinese language Wikipedias and compare that, both worldwide and within China, to the audience using the English Wikipedia.
  • Because comScore does its analysis consistently for all websites, with the same statistical techniques and methodology, we can compare among our projects and to others.
  • With a panel of two million users globally, comScore has strong international coverage.
  • comScore panelists provide demographic data so we can see estimates of factors like age and sex.

[edit] Limitations

  • Coverage of educational users -- comScore focuses on users 15 years old and older using the internet at home or work. Globally, it does not have coverage in schools (though it does have coverage in universities in a few countries). Given our strengths in education this will inevitably lead to significant underreporting of school use and thus our overall audience.
  • Coverage of worldwide usage -- comScore recruits a panel of users across the world, but their coverage can't be perfect. Given our strong international presence, this will likely also lead to some misreporting of our audience. Also, the dynamics of their panel make analysis less and less valuable the deeper you drill down. Statistics for a specific smaller countries (e.g. Egypt) are typically not available or if they are might be less useful depending on the size and make-up of comScore's panel there.
  • Coverage outside home/work -- comScore does not measure people who go online from an internet café or other public/shared computers. This means their audience estimates in certain parts of the world will be significantly underreported. This will have a major impact of underreporting total audience in countries with strong public/shared internet usage. Whether this has a big impact in percentage reach numbers depends on differences in home/work usage and public/shared usage (which might be meaningful in some countries where governments are believed to trace people's internet usage).
  • Coverage of the mobile audience -- This data set of comScore's is of the PC-based internet audience, so excludes access through mobile phones. A recent Nielsen research report estimates there are about 40 million mobile web users in the U.S. alone, and most industry observers expect this number to grow rapidly. This is likely another source of underreporting of the total audience we reach. (FWIW, my personal opinion is that while the vast majority of the first billion internet users access the web through a PC, the vast majority of the second billion internet users will use the web through a mobile phone).

[edit] What's included?

comScore offers a sophisticated ability to combine domains and subdomains to better understand the audience for and performance of our projects. We've worked extensively with them over the past few months to clean-up their definition. We want to be inclusive and careful in defining the different Media titles ([M]), Channels ([C]), and Subchannels ([S]) so we can see what's happening with the different projects. Also, we don't need to be exhaustive and capture every single domain name. A domain which automatically redirects to one of our other sites would end up being counted after the redirect. If you see other changes we should request, start a thread on the Talk page.

Below is comScore's definition as of March 2009, which includes some still experimental efforts to identify edit pages:

[P] Wikimedia Foundation Sites

[M] WIKIBOOKS.ORG
%.WIKIBOOKS.COM%
%.WIKIBOOKS.ORG%
[C] Wikibooks Edit Pages
%.WIKIBOOKS.ORG%&ACTION=EDIT%
[M] Wikimedia Commons
%.COMMONS.WIKIMEDIA.ORG%
COMMONS.WIKIMEDIA.ORG%
[C] Wikimedia Commons Edit Pages
%.COMMONS.WIKIMEDIA.ORG%&ACTION=EDIT%
COMMONS.WIKIMEDIA.ORG%&ACTION=EDIT%
[M] Wikimedia Community Sites
[C] MEDIAWIKI.ORG
%.MEDIAWIKI.ORG%
[C] WIKIMEDIA.AT
%.WIKIMEDIA.AT%
[C] WIKIMEDIA.CH
%.WIKIMEDIA.CH%
[C] WIKIMEDIA.DE
%.WIKIMEDIA.DE%
[C] WIKIMEDIA.FR
%.WIKIMEDIA.FR%
[C] WIKIMEDIA.IN
%.WIKIMEDIA.IN%
[C] WIKIMEDIA.IT
%.WIKIMEDIA.IT%
[C] WIKIMEDIA.NO
%.WIKIMEDIA.NO%
[C] WIKIMEDIA.ORG%
%.WIKIMEDIA.COM%
%.WIKIMEDIA.ORG%
[S] Wikimedia Meta-Wiki Edit Pages
%.META.WIKIMEDIA.ORG%&ACTION=EDIT%
META.WIKIMEDIA.ORG%&ACTION=EDIT%
[C] WIKIMEDIA.ORG.AR
%.WIKIMEDIA.ORG.AR%
[C] WIKIMEDIA.ORG.AU
%.WIKIMEDIA.ORG.AU%
[C] WIKIMEDIA.ORG.CO
%.WIKIMEDIA.ORG.CO%
[C] WIKIMEDIA.ORG.IN
%.WIKIMEDIA.ORG.IN%
[C] WIKIMEDIA.ORG.UK
%.WIKIMEDIA.ORG.UK%
[C] WIKIMEDIA.ORG.VE
%.WIKIMEDIA.ORG.VE%
[C] WIKIMEDIA.RO
%.WIKIMEDIA.RO%
[C] WIKIMEDIA.TW
%.WIKIMEDIA.TW%
[C] WIKIMEDIA.WEB.ID
%.WIKIMEDIA.WEB.ID%
[C] WIKIMEDIAFOUNDATION.ORG
%.WIKIMEDIAFOUNDATION.COM%
%.WIKIMEDIAFOUNDATION.ORG%
[C] WIKIMEDIAHK.ORG
%.WIKIMEDIAHK.ORG%
[C] WMNL.NL
%.WMNL.NL%
[M] WIKINEWS.ORG
%.WIKINEWS.ORG%
[C] Wikinews Edit Pages
%.WIKINEWS.ORG%&ACTION=EDIT%
[M] Wikipedia International Portals
[C] WIKIPEDIA.AT
%.WIKIPEDIA.AT%
[C] WIKIPEDIA.BE
%.WIKIPEDIA.BE%
[C] WIKIPEDIA.CH
%.WIKIPEDIA.CH%
[C] WIKIPEDIA.DE
%.WIKIPEDIA.DE%
[C] WIKIPEDIA.DK
%.WIKIPEDIA.DK%
[C] WIKIPEDIA.FR
%.WIKIPEDIA.FR%
[C] WIKIPEDIA.IT
%.WIKIPEDIA.IT%
[M] WIKIPEDIA.ORG
%.WIKIPEDIA.COM%
%.WIKIPEDIA.CZ%
%.WIKIPEDIA.HU%
%.WIKIPEDIA.NL%
%.WIKIPEDIA.NO%
%.WIKIPEDIA.ORG%
%.WIKIPEDIA.PL%
%.WIKIPEDIA.SE%
[C] Arabic Wikipedia
AR.WIKIPEDIA.ORG%
[C] Chinese Wikipedias
%.WIKIPEDIA.TW%
%.WUU.WIKIPEDIA.ORG%
%.ZH-CLASSICAL.WIKIPEDIA.ORG%
%.ZH-MIN-NAN.WIKIPEDIA.ORG%
%.ZH-YUE.WIKIPEDIA.ORG%
WUU.WIKIPEDIA.ORG%
ZH-CLASSICAL.WIKIPEDIA.ORG%
ZH-MIN-NAN.WIKIPEDIA.ORG%
ZH-YUE.WIKIPEDIA.ORG%
ZH.WIKIPEDIA.ORG%
[C] English Wikipedia
EN.WIKIPEDIA.ORG%
[C] French Wikipedia
FR.WIKIPEDIA.ORG%
[C] German Wikipedia
DE.WIKIPEDIA.ORG%
[C] Indian Wikipedias
%.AS.WIKIPEDIA.ORG%
%.BH.WIKIPEDIA.ORG%
%.BN.WIKIPEDIA.ORG%
%.BPY.WIKIPEDIA.ORG%
%.GU.WIKIPEDIA.ORG%
%.HI.WIKIPEDIA.ORG%
%.KN.WIKIPEDIA.ORG%
%.KS.WIKIPEDIA.ORG%
%.ML.WIKIPEDIA.ORG%
%.MR.WIKIPEDIA.ORG%
%.NE.WIKIPEDIA.ORG%
%.NEW.WIKIPEDIA.ORG%
%.OR.WIKIPEDIA.ORG%
%.PJ.WIKIPEDIA.ORG%
%.SA.WIKIPEDIA.ORG%
%.SD.WIKIPEDIA.ORG%
%.TA.WIKIPEDIA.ORG%
%.TE.WIKIPEDIA.ORG%
%.UR.WIKIPEDIA.ORG%
AS.WIKIPEDIA.ORG%
BH.WIKIPEDIA.ORG%
BN.WIKIPEDIA.ORG%
BPY.WIKIPEDIA.ORG%
GU.WIKIPEDIA.ORG%
HI.WIKIPEDIA.ORG%
KN.WIKIPEDIA.ORG%
KS.WIKIPEDIA.ORG%
ML.WIKIPEDIA.ORG%
MR.WIKIPEDIA.ORG%
NE.WIKIPEDIA.ORG%
NEW.WIKIPEDIA.ORG%
OR.WIKIPEDIA.ORG%
PJ.WIKIPEDIA.ORG%
SA.WIKIPEDIA.ORG%
SD.WIKIPEDIA.ORG%
TA.WIKIPEDIA.ORG%
TE.WIKIPEDIA.ORG%
UR.WIKIPEDIA.ORG%
[C] Italian Wikipedia
%.IT.WIKIPEDIA.ORG%
IT.WIKIPEDIA.ORG%
[C] Japanese Wikipedia
%.JA.WIKIPEDIA.ORG%
%.WIKIPEDIA.JP%
JA.WIKIPEDIA.ORG%
[C] Javanese Wikipedia
%.JV.WIKIPEDIA.ORG%
JV.WIKIPEDIA.ORG%
[C] Korean Wikipedia
%.KO.WIKIPEDIA.ORG%
KO.WIKIPEDIA.ORG%
[C] Portugese Wikipedia
%.PT.WIKIPEDIA.ORG%
%.WIKIPEDIA.ORG.BR%
PT.WIKIPEDIA.ORG%
[C] Russian Wikipedia
%.WIKIPEDIA.SU%
RU.WIKIPEDIA.ORG%
[C] Spanish Wikipedia
ES.WIKIPEDIA.ORG%
[C] Vietnamese Wikipedia
%.VI.WIKIPEDIA.ORG%
VI.WIKIPEDIA.ORG%
[C] Wikipedia Edit Pages
%.WIKIPEDIA.ORG%&ACTION=EDIT%
[S] English Wikipedia Edit Pages
EN.WIKIPEDIA.ORG%&ACTION=EDIT%
[C] Wikipedia.org Homepage
WWW.WIKIPEDIA.ORG/
[M] WIKIQUOTE.ORG
%.WIKIQUOTE.COM%
%.WIKIQUOTE.ORG%
[C] Wikiquote Edit Pages
%.WIKIQUOTE.ORG%&ACTION=EDIT%
[M] WIKISOURCE.ORG
%.WIKISOURCE.COM%
%.WIKISOURCE.ORG%
[C] Wikisource Edit Pages
%.WIKISOURCE.ORG%&ACTION=EDIT%
[M] Wikispecies
%.SPECIES.WIKIMEDIA.ORG%
SPECIES.WIKIMEDIA.ORG%
[C] Wikispecies Edit Pages
SPECIES.WIKIMEDIA.ORG%&ACTION=EDIT%
[M] WIKIVERSITY.ORG
%.WIKIVERSITY.COM%
%.WIKIVERSITY.ORG%
[C] Wikiversity Edit Pages
%.WIKIVERSITY.ORG%&ACTION=EDIT%
[M] WIKTIONARY.ORG
%.WIKTIONARY.COM%
%.WIKTIONARY.ORG%
[C] Wiktionary Edit Pages
%.WIKTIONARY.ORG%&ACTION=EDIT%