User:Stu/comScore data on Wikimedia
From Meta, a Wikimedia project coordination wiki
|
One of the major online audience measurement companies, comScore, Inc., has generously donated access to its Media Metrix and World Metrix data sets to the Foundation. comScore has an opt-in panel of two million internet users around the globe and uses a range of statistical techniques to create an internally consistent portrait of the global internet audience. As an example, below is a chart summarizing comScore's estimated audience over the past few years for wikipedia.org, both in the U.S. and worldwide.
[edit] March 2009 data
comScore estimates that, during the month of March 2009, 327 million unique visitors (UVs) viewed our projects from a personal computer, which it estimates was a "reach" of 29.9% of the 1.09 billion worldwide PC-based web browser audience:
| Worldwide unique visitors | |
| Google Sites (includes YouTube) | 831 million |
| Microsoft Sites | 692 million |
| Yahoo! Sites | 594 million |
| Wikimedia Foundation Sites | 327 million |
| FACEBOOK.COM | 295 million |
| AOL LLC | 286 million |
| eBay | 250 million |
| CBS Corporation (includes CNET) | 203 million |
| Amazon Sites | 195 million |
| Fox Interactive Media (includes Myspace) | 183 million |
Facebook moved into the #5 spot this month, passing AOL.
[edit] Geographic breakdown
comScore estimates our audience in different regions, and also estimates what percentages of the audience within each region visited one of our sites:
| Unique visitors | Reach in region | |
| Worldwide | 327.1 million | 29.9% |
| Europe | 126.5 million | 40.9% |
| Asia Pacific | 78.0 million | 17.8% |
| --India | 7.0 million | 21.0% |
| --China | 2.8 million | 1.4% |
| North America | 66.9 million | 36.0% |
| --United States | 54.8 million | 33.1% |
| Latin America | 34.2 million | 41.5% |
| Middle East - Africa | 21.6 million | 28.2% |
[edit] Language breakdown
comScore estimates visitors to the different language versions of Wikipedia and estimates the unique visitors worldwide:
| Worldwide unique visitors | |
| English Wikipedia | 166.0 million |
| Spanish Wikipedia | 30.6 million |
| Japanese Wikipedia | 28.0 million |
| French Wikipedia | 23.7 million |
| German Wikipedia | 23.2 million |
| Portuguese Wikipedia | 12.2 million |
| Italian Wikipedia | 10.6 million |
| Russian Wikipedia | 9.5 million |
| Arabic Wikipedia | 8.4 million |
| Vietnamese Wikipedia | 4.9 million |
| Chinese language wikipedias | 4.7 million |
| Korean Wikipedia | 2.5 million |
| Indian language wikipedias | .3 million |
This month Spanish surpassed Japanese to become the second most visited Wikipedia after English. Other notables changes included an increase in the Arabic Wikipedia from 6.8 million in February to 8.4 million in March.
[edit] Demographic breakdown
comScore's panelists report age and sex so it can generate detailed demographic estimates, including raw data and also an index which measures the extent to which a set of visitors to our sites is over or under-represented compared to visitors to all sites on the internet. For March, comScore estimates our 327 million audience is made up of 181 million men (30.7% of men online) and 146 million women (29.1% of women online). We index slightly higher with men (102) than with women (97). Here's a breakdown of different age groups:
| Worldwide unique visitors | Reach in age group |
Index | |
| Ages 15-24 | 87 million | 29.7% | 99 |
| Ages 25-34 | 75 million | 26.2% | 88 |
| Ages 35-44 | 68 million | 29.0% | 97 |
| Ages 45-54 | 54 million | 34.9% | 117 |
| Ages 55+ | 42 million | 34.7% | 116 |
We index highest for older users (ages 45-54 and 55+) and lowest for those 25-34 years old. I dug into this issue, at first thinking it was driven by twenty-something preference for YouTube and Facebook. As far as I can tell, though, our comparatively weak performance in the 25-34 year old demographic is the result of our weakness in China where comScore believes there is a huge audience in that age range. For example, Tencent, Baidu and SINA all index above 120 for this demo while Facebook, Google overall, and Yahoo are in the 90s while both MySpace and YouTube are with us down in the 80s.
[edit] Project breakdown
comScore estimates our audience by project:
| Worldwide unique visitors | |
| Wikipedia | 324.7 million |
| Wiktionary | 8.6 million |
| Wikimedia Commons | 5.5 million |
| Wikibooks | 3.8 million |
| Wikisource | 2.9 million |
| Wikiquote | 2.6 million |
| Wikinews | .6 million |
| Wikiversity | .5 million |
| Wikispecies | .2 million |
[edit] China, India trends
comScore estimates the unique visitors to our sites from home and office users in China (excluding Taiwan and Hong Kong). In July of 2008, comScore estimated 232,000 UVs to our sites in China. In August, the month of the 2008 Beijing Olympics, comScore estimates we had 1.3 million visitors. By March, the audience estimate was 2.75 million, comprised of 1.8 million UVs to one of the Chinese language wikipedias and 0.8 million to the English Wikipedia. By contrast, comScore estimates the Baidu Encyclopedia had 39 million visitors from within China in March. Given that comScore does not track internet usage from public locations (e.g. internet cafes), these estimates certainly undercount overall activity from China.
In India, comScore estimates 7.0 million unique visitors came to our sites, or 21% of internet users in India. Of these, 6.9 million visited the English Wikipedia while just over 100,000 visited one of the different Indian language wikipedias.
[edit] Source of traffic
comScore also provides analysis of the site a user surfs just prior to visiting us. The percentage of these "entries" from Google and other search engines is often used as an indicator of reliance on the search engines for traffic. Other major sites like YouTube, eBay or Facebook typically see entries from Google at 10% to 15% of their traffic while we are typically over 50%. Here's a breakdown of the top 4 for us:
| Entries | % of total entries | |
| Google Sites (includes YouTube) | 1,491 million | 57.6% |
| Yahoo! Sites | 147 million | 5.7% |
| Microsoft Sites | 106 million | 4.1% |
| Logon | 28 million | 1.1% |
[edit] Portal usage
We worked with comScore to include estimates on usage of the Wikipedia portal at www.wikipedia.org. There's a wide range of usage across geographies:
| Unique visitors to WP | Unique visitors to portal | % of UVs using portal |
|
| Worldwide | 324.7 million | 15.2 million | 4.7% |
| Europe | 125.4 million | 4.0 million | 3.2% |
| Asia Pacific | 77.6 million | 4.1 million | 5.2% |
| North America | 66.4 million | 5.0 million | 7.4% |
| Latin America | 34.0 million | 1.2 million | 3.5% |
| Middle East - Africa | 21.3 million | .9 million | 4.4% |
[edit] Trend data
I've put together a PDF of comScore's estimates of monthly unique visitors to Wikimedia Foundation Sites since Sep 2007. Contact me at stu
wikimedia.org if you'd like a worksheet with the underlying data.
[edit] Participation estimates
I wanted a sense of what percentage of our audience actively participates. comScore gives good data on unique visitors, and Erik Zachte and others compile counts of registered users who have made at least five edits in a month, which seems a reasonable threshold for active participation as it would eliminate some casual or accidental editors. With data coming from two different data sources it's a bit apples-and-oranges, but is still useful.
The table below shows the calculations for the biggest few Wikipedias. Due to the size of the English Wikipedia, editor compilations happen infrequently so the most recent data covers September of 2008. On the English Wikipedia only about .03% of the unique visitors actively edit. Put another way, that's less than one-third of one-tenth of one percent. If you include all users who made at least one edit, it's about triple that amount or just under .1%.
| Sep '08 UVs from comScore | Sep '08 editors with 5+ edits | % of UVs with 5+ edits |
|
| English Wikipedia | 140,710,255 | 41,393 | 0.029% |
| Japanese Wikipedia | 25,698,145 | 4,390 | 0.017% |
| Spanish Wikipedia | 25,388,063 | 4,016 | 0.016% |
| German Wikipedia | 20,435,314 | 7,144 | 0.035% |
| French Wikipedia | 16,428,023 | 4,602 | 0.028% |
| Portugese Wikipedia | 10,787,686 | 1,710 | 0.016% |
| Italian Wikipedia | 8,637,544 | 3,208 | 0.037% |
| Russian Wikipedia | 6,534,903 | 2,672 | 0.041% |
| Source: UV stats from comScore, editor stats for English Wikipedia from http://en.wikipedia.org/wiki/Wikipedia:Editing_frequency, stats for other wikipedias from http://stats.wikimedia.org/EN/TablesWikipediansEditsGt5.htm | |||
[edit] Analysis from earlier months
[edit] Discussion of comScore & Wikimedia
Jay Walsh on the Foundation's staff is managing the comScore relationship overall, Erik Zachte is helping drive the statistical analysis, and a volunteer named Josh Holman has a lot of experience with comScore data. Feel free to reach out to me or any of them with questions. If there's interest, we'll try to update this page every month or two as new data comes out.
comScore is just one of the different internet measurement options (others include Alexa, Compete, Google Insights for Search, Google Trends, Hitwise, Nielsen, Quantcast, server log analysis).
Finally a quick thank you to comScore. The data they donated typically sells for thousands and thousands of dollars, so we're lucky to be able to review. Speaking on behalf of all of us in the community, I want to thank them for their support.
[edit] Benefits
- comScore has a large and professional team dedicated to audience measurement. We are able to benefit from their insights with no coding, no servers, no hard drives stuffed full of log data, and almost no effort.
- comScore reports "unique visitors", which estimates the actual people using the internet. This puts things into more human terms than page views or an ambiguous "traffic rank" metric. Also, comScore works hard to exclude bots, crawlers, mirrors, click farms, etc.
- comScore works to combine different domains and subdomains. This is particularly useful for international properties. For example, we are able to generate a single audience number for all five or so Chinese language Wikipedias and compare that, both worldwide and within China, to the audience using the English Wikipedia.
- Because comScore does its analysis consistently for all websites, with the same statistical techniques and methodology, we can compare among our projects and to others.
- With a panel of two million users globally, comScore has strong international coverage.
- comScore panelists provide demographic data so we can see estimates of factors like age and sex.
[edit] Limitations
- Coverage of educational users -- comScore focuses on users 15 years old and older using the internet at home or work. Globally, it does not have coverage in schools (though it does have coverage in universities in a few countries). Given our strengths in education this will inevitably lead to significant underreporting of school use and thus our overall audience.
- Coverage of worldwide usage -- comScore recruits a panel of users across the world, but their coverage can't be perfect. Given our strong international presence, this will likely also lead to some misreporting of our audience. Also, the dynamics of their panel make analysis less and less valuable the deeper you drill down. Statistics for a specific smaller countries (e.g. Egypt) are typically not available or if they are might be less useful depending on the size and make-up of comScore's panel there.
- Coverage outside home/work -- comScore does not measure people who go online from an internet café or other public/shared computers. This means their audience estimates in certain parts of the world will be significantly underreported. This will have a major impact of underreporting total audience in countries with strong public/shared internet usage. Whether this has a big impact in percentage reach numbers depends on differences in home/work usage and public/shared usage (which might be meaningful in some countries where governments are believed to trace people's internet usage).
- Coverage of the mobile audience -- This data set of comScore's is of the PC-based internet audience, so excludes access through mobile phones. A recent Nielsen research report estimates there are about 40 million mobile web users in the U.S. alone, and most industry observers expect this number to grow rapidly. This is likely another source of underreporting of the total audience we reach. (FWIW, my personal opinion is that while the vast majority of the first billion internet users access the web through a PC, the vast majority of the second billion internet users will use the web through a mobile phone).
[edit] What's included?
comScore offers a sophisticated ability to combine domains and subdomains to better understand the audience for and performance of our projects. We've worked extensively with them over the past few months to clean-up their definition. We want to be inclusive and careful in defining the different Media titles ([M]), Channels ([C]), and Subchannels ([S]) so we can see what's happening with the different projects. Also, we don't need to be exhaustive and capture every single domain name. A domain which automatically redirects to one of our other sites would end up being counted after the redirect. If you see other changes we should request, start a thread on the Talk page.
Below is comScore's definition as of March 2009, which includes some still experimental efforts to identify edit pages:
[P] Wikimedia Foundation Sites
- [M] WIKIBOOKS.ORG
- %.WIKIBOOKS.COM%
- %.WIKIBOOKS.ORG%
- [C] Wikibooks Edit Pages
- %.WIKIBOOKS.ORG%&ACTION=EDIT%
- [M] Wikimedia Commons
- %.COMMONS.WIKIMEDIA.ORG%
- COMMONS.WIKIMEDIA.ORG%
- [C] Wikimedia Commons Edit Pages
- %.COMMONS.WIKIMEDIA.ORG%&ACTION=EDIT%
- COMMONS.WIKIMEDIA.ORG%&ACTION=EDIT%
- [M] Wikimedia Community Sites
- [C] MEDIAWIKI.ORG
- %.MEDIAWIKI.ORG%
- [C] WIKIMEDIA.AT
- %.WIKIMEDIA.AT%
- [C] WIKIMEDIA.CH
- %.WIKIMEDIA.CH%
- [C] WIKIMEDIA.DE
- %.WIKIMEDIA.DE%
- [C] WIKIMEDIA.FR
- %.WIKIMEDIA.FR%
- [C] WIKIMEDIA.IN
- %.WIKIMEDIA.IN%
- [C] WIKIMEDIA.IT
- %.WIKIMEDIA.IT%
- [C] WIKIMEDIA.NO
- %.WIKIMEDIA.NO%
- [C] WIKIMEDIA.ORG%
- %.WIKIMEDIA.COM%
- %.WIKIMEDIA.ORG%
- [S] Wikimedia Meta-Wiki Edit Pages
- %.META.WIKIMEDIA.ORG%&ACTION=EDIT%
- META.WIKIMEDIA.ORG%&ACTION=EDIT%
- [C] WIKIMEDIA.ORG.AR
- %.WIKIMEDIA.ORG.AR%
- [C] WIKIMEDIA.ORG.AU
- %.WIKIMEDIA.ORG.AU%
- [C] WIKIMEDIA.ORG.CO
- %.WIKIMEDIA.ORG.CO%
- [C] WIKIMEDIA.ORG.IN
- %.WIKIMEDIA.ORG.IN%
- [C] WIKIMEDIA.ORG.UK
- %.WIKIMEDIA.ORG.UK%
- [C] WIKIMEDIA.ORG.VE
- %.WIKIMEDIA.ORG.VE%
- [C] WIKIMEDIA.RO
- %.WIKIMEDIA.RO%
- [C] WIKIMEDIA.TW
- %.WIKIMEDIA.TW%
- [C] WIKIMEDIA.WEB.ID
- %.WIKIMEDIA.WEB.ID%
- [C] WIKIMEDIAFOUNDATION.ORG
- %.WIKIMEDIAFOUNDATION.COM%
- %.WIKIMEDIAFOUNDATION.ORG%
- [C] WIKIMEDIAHK.ORG
- %.WIKIMEDIAHK.ORG%
- [C] WMNL.NL
- %.WMNL.NL%
- [C] MEDIAWIKI.ORG
- [M] WIKINEWS.ORG
- %.WIKINEWS.ORG%
- [C] Wikinews Edit Pages
- %.WIKINEWS.ORG%&ACTION=EDIT%
- [M] Wikipedia International Portals
- [C] WIKIPEDIA.AT
- %.WIKIPEDIA.AT%
- [C] WIKIPEDIA.BE
- %.WIKIPEDIA.BE%
- [C] WIKIPEDIA.CH
- %.WIKIPEDIA.CH%
- [C] WIKIPEDIA.DE
- %.WIKIPEDIA.DE%
- [C] WIKIPEDIA.DK
- %.WIKIPEDIA.DK%
- [C] WIKIPEDIA.FR
- %.WIKIPEDIA.FR%
- [C] WIKIPEDIA.IT
- %.WIKIPEDIA.IT%
- [C] WIKIPEDIA.AT
- [M] WIKIPEDIA.ORG
- %.WIKIPEDIA.COM%
- %.WIKIPEDIA.CZ%
- %.WIKIPEDIA.HU%
- %.WIKIPEDIA.NL%
- %.WIKIPEDIA.NO%
- %.WIKIPEDIA.ORG%
- %.WIKIPEDIA.PL%
- %.WIKIPEDIA.SE%
- [C] Arabic Wikipedia
- AR.WIKIPEDIA.ORG%
- [C] Chinese Wikipedias
- %.WIKIPEDIA.TW%
- %.WUU.WIKIPEDIA.ORG%
- %.ZH-CLASSICAL.WIKIPEDIA.ORG%
- %.ZH-MIN-NAN.WIKIPEDIA.ORG%
- %.ZH-YUE.WIKIPEDIA.ORG%
- WUU.WIKIPEDIA.ORG%
- ZH-CLASSICAL.WIKIPEDIA.ORG%
- ZH-MIN-NAN.WIKIPEDIA.ORG%
- ZH-YUE.WIKIPEDIA.ORG%
- ZH.WIKIPEDIA.ORG%
- [C] English Wikipedia
- EN.WIKIPEDIA.ORG%
- [C] French Wikipedia
- FR.WIKIPEDIA.ORG%
- [C] German Wikipedia
- DE.WIKIPEDIA.ORG%
- [C] Indian Wikipedias
- %.AS.WIKIPEDIA.ORG%
- %.BH.WIKIPEDIA.ORG%
- %.BN.WIKIPEDIA.ORG%
- %.BPY.WIKIPEDIA.ORG%
- %.GU.WIKIPEDIA.ORG%
- %.HI.WIKIPEDIA.ORG%
- %.KN.WIKIPEDIA.ORG%
- %.KS.WIKIPEDIA.ORG%
- %.ML.WIKIPEDIA.ORG%
- %.MR.WIKIPEDIA.ORG%
- %.NE.WIKIPEDIA.ORG%
- %.NEW.WIKIPEDIA.ORG%
- %.OR.WIKIPEDIA.ORG%
- %.PJ.WIKIPEDIA.ORG%
- %.SA.WIKIPEDIA.ORG%
- %.SD.WIKIPEDIA.ORG%
- %.TA.WIKIPEDIA.ORG%
- %.TE.WIKIPEDIA.ORG%
- %.UR.WIKIPEDIA.ORG%
- AS.WIKIPEDIA.ORG%
- BH.WIKIPEDIA.ORG%
- BN.WIKIPEDIA.ORG%
- BPY.WIKIPEDIA.ORG%
- GU.WIKIPEDIA.ORG%
- HI.WIKIPEDIA.ORG%
- KN.WIKIPEDIA.ORG%
- KS.WIKIPEDIA.ORG%
- ML.WIKIPEDIA.ORG%
- MR.WIKIPEDIA.ORG%
- NE.WIKIPEDIA.ORG%
- NEW.WIKIPEDIA.ORG%
- OR.WIKIPEDIA.ORG%
- PJ.WIKIPEDIA.ORG%
- SA.WIKIPEDIA.ORG%
- SD.WIKIPEDIA.ORG%
- TA.WIKIPEDIA.ORG%
- TE.WIKIPEDIA.ORG%
- UR.WIKIPEDIA.ORG%
- [C] Italian Wikipedia
- %.IT.WIKIPEDIA.ORG%
- IT.WIKIPEDIA.ORG%
- [C] Japanese Wikipedia
- %.JA.WIKIPEDIA.ORG%
- %.WIKIPEDIA.JP%
- JA.WIKIPEDIA.ORG%
- [C] Javanese Wikipedia
- %.JV.WIKIPEDIA.ORG%
- JV.WIKIPEDIA.ORG%
- [C] Korean Wikipedia
- %.KO.WIKIPEDIA.ORG%
- KO.WIKIPEDIA.ORG%
- [C] Portugese Wikipedia
- %.PT.WIKIPEDIA.ORG%
- %.WIKIPEDIA.ORG.BR%
- PT.WIKIPEDIA.ORG%
- [C] Russian Wikipedia
- %.WIKIPEDIA.SU%
- RU.WIKIPEDIA.ORG%
- [C] Spanish Wikipedia
- ES.WIKIPEDIA.ORG%
- [C] Vietnamese Wikipedia
- %.VI.WIKIPEDIA.ORG%
- VI.WIKIPEDIA.ORG%
- [C] Wikipedia Edit Pages
- %.WIKIPEDIA.ORG%&ACTION=EDIT%
- [S] English Wikipedia Edit Pages
- EN.WIKIPEDIA.ORG%&ACTION=EDIT%
- [C] Wikipedia.org Homepage
- WWW.WIKIPEDIA.ORG/
- [M] WIKIQUOTE.ORG
- %.WIKIQUOTE.COM%
- %.WIKIQUOTE.ORG%
- [C] Wikiquote Edit Pages
- %.WIKIQUOTE.ORG%&ACTION=EDIT%
- [M] WIKISOURCE.ORG
- %.WIKISOURCE.COM%
- %.WIKISOURCE.ORG%
- [C] Wikisource Edit Pages
- %.WIKISOURCE.ORG%&ACTION=EDIT%
- [M] Wikispecies
- %.SPECIES.WIKIMEDIA.ORG%
- SPECIES.WIKIMEDIA.ORG%
- [C] Wikispecies Edit Pages
- SPECIES.WIKIMEDIA.ORG%&ACTION=EDIT%
- [M] WIKIVERSITY.ORG
- %.WIKIVERSITY.COM%
- %.WIKIVERSITY.ORG%
- [C] Wikiversity Edit Pages
- %.WIKIVERSITY.ORG%&ACTION=EDIT%
- [M] WIKTIONARY.ORG
- %.WIKTIONARY.COM%
- %.WIKTIONARY.ORG%
- [C] Wiktionary Edit Pages
- %.WIKTIONARY.ORG%&ACTION=EDIT%