Research:Which parts of an article do readers read
|This page is currently a draft. More information pertaining to this may be available on the talk page.
Translation admins: Normally, drafts should not be marked for translation.
This research topic page provides an overview of results, methods and data sources pertaining to the question "Which parts of a Wikipedia article[notes 1] do users actually view?" .
A 2015 study of clickstream data (on desktop) from the English Wikipedia found that the rate at which wikilinks (internal links) are clicked decreases from top to bottom of the page, although more gradually than one might expect (see chart).
Other researchers who looked at the same data estimated that wikilinks located in the lead section receive between 26% and 43% of the clicks on wikilinks,. A follow-up study found that although the lead and the infobox contain only 17% and 4% of the links of an article, they receive 32% and 18% of clicks, respectively . Links located on the left side of the screen (assuming a browser window that takes up the entire screen on a standard WUXGA display on desktop) are more likely to be clicked.
On the mobile web version of Wikipedia, sections below the lede are by default collapsed (on devices below a certain screen size, i.e. on smartphones but not on tablets). The readers needs to tap on the section heading to read its content. These actions are counted - for a small sample of readers - in the MobileWebSectionUsage schema.
39.9% of the non-tablet mobile users who viewed a mainspace page on November 30 opened a section there, i.e. the median number of sections opened was 0.
(This refers to instrumentations that recorded which parts of a page appeared in the viewable area.)
On the Android Wikipedia app, around 68% of pageviews involve the reader scrolling down at least once (June 2017, excludes navigation via TOC).
During one week in 2014, 25% of app users (devices) scrolled to the end of a page at least once.
German thesis: E.g. in "lookup" tasks, readers spend >45% of time on scanning TOC and lists ("QL-LI"), in "learn" tasks it's <10%
Related German paper: "To get insights into users' interaction with pictorial and textual contents eye-tracking experiments are conducted. Spread of information within the articles and the relation between text and images are analyzed. ... By now 30 articles have been analyzed according to this scheme. There are 639 contact points leading to images. Results show that 39% of all contact points lead from image to image, in mutual directions (previous or next). All text contact points (T, TC, TB, TE, TN, Cit) sum up to a total of 37%. In 5% of all cases, an introduction triggers a saccade to an image. The remaining types of contact points occur rather rarely."
A 2012 conference paper by four researchers from Scotland, titled "Looking for genre: the use of structural features during search tasks with Wikipedia" described the results of an eye tracking study with 30 participants asked to carry out various research tasks on Wikipedia. A main finding was that readers tended to look first at the table of contents, then at the article's infobox. More generally, they "extensively interacted with layout features, such as tables, titles, bullet lists, contents lists, information boxes, and references", and were also observed to frequently "skim and scroll" long articles.
See also: Demo video of an affordable eyetracking system used on a Wikipedia article (2018)
The page previews feature was introduced in 2017/18 on desktop Wikipedia. Reader can hover their mouse over a link to see an excerpt of the linked page. An internal dataset contains aggregated numbers on how many previews were viewed for a given link, which (similarly to the clickstream data mentioned above) can be used to generate a heatmap of hovers that indicates which parts of the page were read, but also which topics(links) readers are most interested in looking up briefly from the source page.
- Ashwin Paranjape, Bob West, Jure Leskovec, Leila Zia: Improving Website Hyperlink Structure Using Server Logs. WSDM’16, February 22–25, 2016, San Francisco, CA, USA. PDF
- Lamprecht, Daniel; Helic, Denis; Strohmaier, Markus (2015-04-22). "Quo Vadis? On the Effects of Wikipedia's Policies on Navigation". Ninth International AAAI Conference on Web and Social Media. Ninth International AAAI Conference on Web and Social Media.
- Lamprecht, Daniel; Lerman, Kristina; Helic, Denis; Strohmaier, Markus (May 2016). "How the structure of Wikipedia articles influences user navigation". New Review of Hypermedia and Multimedia. doi:10.1080/13614568.2016.1179798. Retrieved December 15, 2016.
- Dimitrov, Dimitar; Singer, Philipp; Lemmerich, Florian; Strohmaier, Markus (2016-04-11). Visual Positions of Links and Clicks on Wikipedia (PDF). 25TH INTERNATIONAL WORLD WIDE WEB CONFERENCE. Montréal, Québec, Canada. p. 2.
-  "25% of install base saw at least one read more panel", meaning that these app users read or scrolled to the end of an article at least once (where these panels are located).
- Knäusl, Hanna (2014-12-18). "Situationsabhängige Rezeption von Information bei Verwendung der Wikipedia" (Thesis of the University of Regensburg). p. 202 (in German, with English abstract), cf. 2012 poster
- Rösch, Barbara (2014). "Investigation of Information Behavior in Wikipedia Articles". Proceedings of the 5th Information Interaction in Context Symposium. IIiX '14. New York, NY, USA: ACM. pp. 351–353. ISBN 978-1-4503-2976-7. doi:10.1145/2637002.2637062.
- Clark, Malcolm; Ruthven, Ian; O’Brian Holt, Patrik and Song, Dawei (2012). Looking for genre: the use of structural features during search tasks with Wikipedia. Fourth Information Interaction in Context Conference (IIiX 2012). DOI • PDF
- (or other pages on Wikimedia projects)