Jump to content

Wikipedia:Wikipedia Signpost/2024-04-25/Recent research: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
No edit summary
No edit summary
Line 72: Line 72:
</blockquote>
</blockquote>
See also research project page on Meta-wiki: [[m:Research:Understanding Curious and Critical Readers]]
See also research project page on Meta-wiki: [[m:Research:Understanding Curious and Critical Readers]]


===="Quantifying knowledge synchronization [between Wikipedia language versions] with the network-driven approach"====
From the paper:<ref>{{Cite journal| doi = 10.1016/j.joi.2023.101455| issn = 1751-1577| volume = 17| issue = 4| pages = 101455| last1 = Yoon| first1 = Jisung| last2 = Park| first2 = Jinseo| last3 = Yun| first3 = Jinhyuk| last4 = Jung| first4 = Woo-Sung| title = Quantifying knowledge synchronization with the network-driven approach| journal = Journal of Informetrics| date = 2023-11-01| url = https://www.sciencedirect.com/science/article/pii/S1751157723000809}}</ref>
<blockquote style="padding-left:1.0em; padding-right:1.0em; background-color:#eaf8f4;">
"[...] we explore the dominant path of knowledge diffusion in the 21st century using Wikipedia, the largest communal dataset. We evaluate the similarity of shared knowledge between population groups, distinguished based on their language usage. When population groups are more engaged with each other, their knowledge structure is more similar, where engagement is indicated by socio-economic connections, such as cultural, linguistic, and historical features. Moreover, geographical proximity is no longer a critical requirement for knowledge dissemination.<br>
We used Wikipedia SQL dump of 59 different language editions on February 1, 2019. [...] Specifically, we used two collections of the Wikipedia dump: category membership link records (*-categorylinks.sql.gz) and interlanguage link records (*-langlinks.sql.gz). [...] From the linkage between Wikipedia pages and categories, we extracted a hierarchical knowledge network of each language edition. [...Based on these per-language structures] we constructed the similarity network from the pairwise knowledge structure similarity, where nodes represent the language of Wikipedia, and the link's weight indicates similarity between languages.<br>
"English is in the center and serves as a hub node, while intermediate hub languages such as Spanish, German, French, Russian, Portuguese, Chinese, and Dutch also function as cluster [[centroid]]s"
</blockquote>





Revision as of 19:01, 23 April 2024

Recent research

New survey of over 100,000 Wikipedia users


A monthly overview of recent academic research about Wikipedia and other Wikimedia projects, also published as the Wikimedia Research Newsletter.


Survey dataset of over 100,000 Wikipedia readers and contributors

From the abstract:[1]

"The dataset focuses on Wikipedia users and contains information about demographic and socioeconomic characteristics of the respondents and their activity on Wikipedia. The data was collected using a questionnaire available online between June and July 2023. The link to the questionnaire was distributed via a banner published in 8 languages on the Wikipedia page. [...] The survey includes 200 questions about: what people were doing on Wikipedia before clicking the link to the questionnaire; how they use Wikipedia as readers (``professional and ``personal uses); their opinion on the quality, the thematic coverage, the importance of the encyclopaedia; the making of Wikipedia (how they think it is made, if they have ever contributed and how); their social, sport, artistic and cultural activities, both online and offline; their socio-economic characteristics including political beliefs, and trust propensities. More than 200 000 people opened the questionnaire, 100 332 started to answer, and constitute our dataset, and 10 576 finished it."

This dataset paper doesn't contain any results from the survey itself. And from the communications around it (including the project's page on Meta-wiki at Research:Surveying readers and contributors to Wikipedia) it is not clear whether and when the authors or others are planning to publish any analyses themselves. Hence we are taking a quick look ourselves at some topline results below (note: these are taken directly from the "filtered" dataset published by the authors, without any weighing by language or other debiasing efforts). It remains to be hoped that more use will be made of this data soon, also considering that various questions appear to have been designed for compatibility with certain previous surveys.

These gender ratios are notably somewhat more balanced than e.g. the figures from the Wikimedia Foundations "Community Insights" surveys of recent years; however, those targeted a different population consisting exclusively of contributors. Still, the gender gap in this new survey data is even somewhat smaller than that found for English-language Wikipedia readers in a past survey by the Wikimedia Foundation (cf. below).

Distribution of responses to the question "In political matters, people talk of 'the left' and 'the right.' How would you place your views on this scale, generally speaking?" (NB: 11.7% of those who responded chose the option "This distinction does not speak to you").

Unless we are dealing with a data anomaly here, this chart shows a general preponderance of left-of-center political positions among Wikipedia users, partly balanced out by a substantial share of far-right users (10 on a scale from 1 = left to 10 = right)

Briefly

  • The Wikimedia Foundation invites feedback on a whitepaper about "Wikimedia Research Best Practices Around Privacy" (until April 30), see also News and notes in this Signpost issue
  • The Wikimedia Foundation's research department invites proposals (deadline: April 29) for the "Wiki Workshop Hall", a new feature of the annual Wiki Workshop online conference consisting of two 30-minute sessions "for Wikimedia researchers and Wikimedia movement members to connect with each other."
  • See the page of the monthly Wikimedia Research Showcase for videos and slides of past presentations.

Other recent publications

Other recent publications that could not be covered in time for this issue include the items listed below. Contributions, whether reviewing or summarizing newly published research, are always welcome.

"Global Gender Differences in Wikipedia Readership"

"Wikipedia reader gender by language" (from 2019 survey data)

From the abstract and introduction:[2]

"From a global online survey of 65,031 readers of Wikipedia and their corresponding reading logs, we present first evidence of gender differences in Wikipedia readership and how they manifest in records of user behavior. More specifically we report that (1) women are underrepresented among readers of Wikipedia, (2) women view fewer pages per reading session than men do, (3) men and women visit Wikipedia for similar reasons, and (4) men and women exhibit specific topical preferences"
"Across 16 surveys, men represent approximately two-thirds of Wikipedia readers on any given day. Additionally, we observe that women view fewer pages per reading session than men do. However, we also find that on average, men and women visit Wikipedia for similar reasons. That is, the depth of knowledge that they seek, referred to as information need for the remainder of this paper, and their triggers for reading Wikipedia, referred to as motivations, are remarkably similar. Finally, men and women exhibit specific topical preferences. Readership of articles about sports, games, and mathematics is skewed to-wards men, while readership of articles about broadcasting, medicine, and entertainment is skewed towards women. We further observe evidence of self-focus bias[...], i.e. that women tend to read relatively more biographies of women than men do, whereas men tend to read relatively more biographies of men than women do."<br< "closing content gaps is not a panacea as evidenced by prior research on Welsh Wikipedia, where a majority of the biographies are about women [...], a majority of Welsh speakers are women,[...] but readership is still heavily skewed towards men"

See also project page on Meta-wiki: m:Research:Characterizing_Wikipedia_Reader_Behaviour/Demographics_and_Wikipedia_use_cases and a subsequent literature review which formulated various potential explanations for the observed gender gap in Wikipedia readers.


"Hunters, busybodies and the knowledge network building associated with deprivation curiosity"

From the abstract:[3]

"A recently developed historicophilosophical taxonomy of curious practice distinguishes between the collection of disparate, loosely connected pieces of information and the seeking of related, tightly connected pieces of information. With this taxonomy, we use a novel knowledge network building framework of curiosity to capture styles of curious information seeking in 149 participants as they explore Wikipedia for over 5 hours spanning 21 days. We create knowledge networks in which nodes consist of distinct concepts (unique Wikipedia pages) and edges represent the similarity between the content of Wikipedia pages. We quantify the tightness of each participants' knowledge networks using graph theoretical indices and use a generative model of network growth to explore mechanisms underlying the observed information seeking. We find that participants create knowledge networks with small-world and modular structure. Deprivation sensitivity, the tendency to seek information that eliminates knowledge gaps, is associated with the creation of relatively tight networks and a relatively greater tendency to return to previously-visited concepts. We further show that there is substantial within-person variability in knowledge network building over time and that building looser networks than usual is linked with higher than usual sensation seeking."

See also an explanatory Twitter thread by one of the authors


"Architectural styles of curiosity in global Wikipedia mobile app readership"

From the abstract:[4]

"[...] most curiosity research relies on small, Western convenience samples. Here, we expand an analysis of a laboratory study with 149 participants browsing Wikipedia to 482,760 readers using Wikipedia's mobile app in 14 languages from 50 countries or territories. By measuring the structure of knowledge networks constructed by readers weaving a thread through articles in Wikipedia, we provide the fi�rst replication of two distinctive architectural styles of curiosity: that of the busybody and of the hunter [in reference to the above paper involving some of the same authors ...] Finally, across languages and countries, we identify novel associations between the structure of knowledge networks and population-level indicators of spatial navigation, education, mood, well-being, and inequality."

See also research project page on Meta-wiki: m:Research:Understanding Curious and Critical Readers


"Quantifying knowledge synchronization [between Wikipedia language versions] with the network-driven approach"

From the paper:[5]

"[...] we explore the dominant path of knowledge diffusion in the 21st century using Wikipedia, the largest communal dataset. We evaluate the similarity of shared knowledge between population groups, distinguished based on their language usage. When population groups are more engaged with each other, their knowledge structure is more similar, where engagement is indicated by socio-economic connections, such as cultural, linguistic, and historical features. Moreover, geographical proximity is no longer a critical requirement for knowledge dissemination.
We used Wikipedia SQL dump of 59 different language editions on February 1, 2019. [...] Specifically, we used two collections of the Wikipedia dump: category membership link records (*-categorylinks.sql.gz) and interlanguage link records (*-langlinks.sql.gz). [...] From the linkage between Wikipedia pages and categories, we extracted a hierarchical knowledge network of each language edition. [...Based on these per-language structures] we constructed the similarity network from the pairwise knowledge structure similarity, where nodes represent the language of Wikipedia, and the link's weight indicates similarity between languages.
"English is in the center and serves as a hub node, while intermediate hub languages such as Spanish, German, French, Russian, Portuguese, Chinese, and Dutch also function as cluster centroids"


Despite teachers' skepticism, 86% of Estonian high school students use Wikipedia at least a couple of times per month (female students more often)

From the abstract:[6]

"The article is based on a quantitative study in which 381 Estonian school children [9th and 12th grade students] participated in filling out an online survey. The questionnaire included both multiple-choice and open-ended questions. Findings: Statistical analyses and responses to open-ended questions showed that students often use Wikipedia as a primary source of information, but that their use of the site for learning tasks is guided by teachers’ attitudes and perceptions towards Wikipedia. Students perceive Wikipedia as a quick and convenient source of information but are uncertain about its reliability."

From the "Results" section:

"[...] 5% of the students surveyed use Wikipedia every day, 51% at least a couple of times a week and 30% a couple of times a month. To compare the groups, we conducted a t-test, which concluded that statistically significant differences were present across gender and grades. For the purpose of the calculations, we treated responses as numerical (rarely/not at all = 1, a few times a year = 2, a few times a month = 3, a few times a week = 4, every day = 5). For gender, the mean is 3.73 for women and 3.46 for men (p < 0.05). Thus, there is a statistically significant difference in the frequency of Wikipedia use between the two groups, with female students using Wikipedia more often than male students. [...] 24% of the students surveyed said that teachers had no objection to using Wikipedia, 3% said that teachers did not allow to use Wikipedia, 47% said that some teachers did and some did not and 10% said that they did not know. Teachers do not explicitly forbid students from using Wikipedia for learning tasks, but they do recommend that students use more trustworthy sources [...]"


"With or without Wikipedia? Integrating Wikipedia into the Teaching Process in Estonian General Education Schools"

From the abstract:[7]

The study is based on semi-structured interviews with 49 teachers from 11 general education schools in Estonia. The results of the qualitative content analysis of the interviews indicate that teachers consider the use of Wikipedia to be a suitable for teaching, alongside other information sources and environments. However, teachers acknowledge some uncertainty and caution towards Wikipedia, as they do not consider it a very reliable teaching tool: an attitude largely inherited from the early days of Wikipedia. While teachers themselves are active and frequent Wikipedia users, and allow students to search for information, they do not assign Wikipedia-based text-creation tasks to students. "

References

  1. ^ Cruciani, Caterina; Joubert, Léo; Jullien, Nicolas; Mell, Laurent; Piccione, Sasha; Vermeirsche, Jeanne (2023-12-01). "Surveying Wikipedians: a dataset of users and contributors' practices on Wikipedia in 8 languages". arXiv:2311.07964. / Dataset: Cruciani, Caterina; Joubert, Léo; Jullien, Nicolas; Mell, Laurent; Piccione, Sasha; Vermeirsche, Jeanne (2023-12-01), Surveying Wikipedians: a dataset of users and contributors’ practices on Wikipedia in 8 languages, doi:10.34847/nkl.4ecf4u8m
  2. ^ Johnson, Isaac; Lemmerich, Florian; Sáez-Trumper, Diego; West, Robert; Strohmaier, Markus; Zia, Leila (2021-05-22). "Global Gender Differences in Wikipedia Readership". Proceedings of the International AAAI Conference on Web and Social Media. 15: 254–265. ISSN 2334-0770.
  3. ^ Lydon-Staley, David M.; Zhou, Dale; Blevins, Ann Sizemore; Zurn, Perry; Bassett, Danielle S. (2020-11-30). "Hunters, busybodies and the knowledge network building associated with deprivation curiosity". Nature Human Behaviour: 1–10. doi:10.1038/s41562-020-00985-7. ISSN 2397-3374. Closed access icon Earlier preprint: Lydon-Staley, David Martin; Zhou, Dale; Blevins, Ann Sizemore; Zurn, Perry; Bassett, Danielle S. (2019-06-08). Hunters, busybodies, and the knowledge network building associated with curiosity. PsyArXiv.
  4. ^ Zhou, Dale; Patankar, Shubhankar; Lydon-Staley, David Martin; Zurn, Perry; Gerlach, Martin; Bassett, Danielle S. (2023-11-02), Architectural styles of curiosity in global Wikipedia mobile app readership, PsyArXiv, doi:10.31234/osf.io/szuyj
  5. ^ Yoon, Jisung; Park, Jinseo; Yun, Jinhyuk; Jung, Woo-Sung (2023-11-01). "Quantifying knowledge synchronization with the network-driven approach". Journal of Informetrics. 17 (4): 101455. doi:10.1016/j.joi.2023.101455. ISSN 1751-1577.
  6. ^ Remmik, Marvi; Siiman, Ann; Reinsalu, Riina; Vija, Maigi; Org, Andrus (January 2024). "Using Wikipedia to Develop 21st Century Skills: Perspectives from General Education Students". Education Sciences. 14 (1): 101. doi:10.3390/educsci14010101. ISSN 2227-7102.{{cite journal}}: CS1 maint: unflagged free DOI (link)
  7. ^ Reinsalu, Riina; Vija, Maigi; Org, Andrus; Siiman, Ann; Remmik, Marvi (June 2023). "With or without Wikipedia? Integrating Wikipedia into the Teaching Process in Estonian General Education Schools". Education Sciences. 13 (6): 583. doi:10.3390/educsci13060583. ISSN 2227-7102.{{cite journal}}: CS1 maint: unflagged free DOI (link)

This page is a draft for the next issue of the Signpost. Below is some helpful code that will help you write and format a Signpost draft. If it's blank, you can fill out a template by copy-pasting this in and pressing 'publish changes': {{subst:Wikipedia:Wikipedia Signpost/Templates/Story-preload}}


Images and Galleries
Sidebar images

To put an image in your article, use the following template (link):

[[File:|center|300px|alt=TKTK]]

O frabjous day.
{{Wikipedia:Wikipedia Signpost/Templates/Filler image-v2
 |image     = 
 |size      = 300px
 |alt       = TKTK
 |caption   = 
 |fullwidth = no
}}

This will create the file on the right. Keep the 300px in most cases. If writing a 'full width' article, change |fullwidth=no to |fullwidth=yes.

Inline images

Placing

{{Wikipedia:Wikipedia Signpost/Templates/Inline image
 |image   =
 |size    = 300px
 |align   = center
 |alt     = Placeholder alt text
 |caption = CAPTION
}}

(link) will instead create an inline image like below

[[File:|300px|center|alt=Placeholder alt text]]
CAPTION
Galleries

To create a gallery, use the following

<gallery mode = packed | heights = 200px>
|Caption for second image
</gallery>

to create

Quotes
Framed quotes

To insert a framed quote like the one on the right, use this template (link):

{{Wikipedia:Wikipedia Signpost/Templates/Filler quote-v2
 |1         = 
 |author    = 
 |source    = 
 |fullwidth = 
}}

If writing a 'full width' article, change |fullwidth=no to |fullwidth=yes.

Pull quotes

To insert a pull quote like

use this template (link):

{{Wikipedia:Wikipedia Signpost/Templates/Quote
 |1         = 
 |source    = 
}}
Long quotes

To insert a long inline quote like

The goose is on the loose! The geese are on the lease!
— User:Oscar Wilde
— Quotations Notes from the Underpoop

use this template (link):

{{Wikipedia:Wikipedia Signpost/Templates/block quote
 | text   = 
 | by     = 
 | source = 
 | ts     = 
 | oldid  = 
}}
Side frames

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

A caption

Side frames help put content in sidebar vignettes. For instance, this one (link):

{{Wikipedia:Wikipedia Signpost/Templates/Filler frame-v2
 |1         = Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.
 |caption   = A caption
 |fullwidth = no
}}

gives the frame on the right. This is useful when you want to insert non-standard images, quotes, graphs, and the like.

Example − Graph/Charts
A caption

For example, to insert the {{Graph:Chart}} generated by

{{Graph:Chart
 |width=250|height=100|type=line
 |x=1,2,3,4,5,6,7,8|y=10,12,6,14,2,10,7,9
}}

in a frame, simple put the graph code in |1=

{{Wikipedia:Wikipedia Signpost/Templates/Filler frame-v2
 |1=
{{Graph:Chart
 |width=250|height=100|type=line
 |x=1,2,3,4,5,6,7,8|y=10,12,6,14,2,10,7,9
}}
 |caption=A caption
 |fullwidth=no
}}

to get the framed Graph:Chart on the right.

If writing a 'full width' article, change |fullwidth=no to |fullwidth=yes.

Two-column vs full width styles

If you keep the 'normal' preloaded draft and work from there, you will be using the two-column style. This is perfectly fine in most cases and you don't need to do anything.

However, every time you have a |fullwidth=no and change it to |fullwidth=yes (or vice-versa), the article will take that style from that point onwards (|fullwidth=yes → full width, |fullwidth=no → two-column). By default, omitting |fullwidth= is the same as putting |fullwidth=no and the article will have two columns after that. Again, this is perfectly fine in most cases, and you don't need to do anything.

However, you can also fine-tune which style is used at which point in an article.

To switch from two-column → full width style midway in an article, insert

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-end-v2}}
{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-start-v2|fullwidth=yes}}

where you want the switch to happen.

To switch from full width → two-column style midway in an article, insert

{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-end-v2}}
{{Wikipedia:Wikipedia Signpost/Templates/Signpost-block-start-v2|fullwidth=no}}

where you want the switch to happen.

Article series

To add a series of 'related articles' your article, use the following code

Related articles
Visual Editor

Five, ten, and fifteen years ago
1 January 2023

VisualEditor, endowment, science, and news in brief
5 August 2015

HTTPS-only rollout completed, proposal to enable VisualEditor for new accounts
17 June 2015

VisualEditor and MediaWiki updates
29 April 2015

Security issue fixed; VisualEditor changes
4 February 2015


More articles

{{Signpost series
 |type        = sidebar-v2
 |tag         = VisualEditor
 |seriestitle = Visual Editor
 |fullwidth   = no
}}

or

{{Signpost series
 |type        = sidebar-v2
 |tag         = VisualEditor
 |seriestitle = Visual Editor
 |fullwidth   = yes
}}

will create the sidebar on the right. If writing a 'full width' article, change |fullwidth=no to |fullwidth=yes. A partial list of valid |tag= parameters can be found at here and will decide the list of articles presented. |seriestitle= is the title that will appear below 'Related articles' in the box.

Alternatively, you can use

{{Signpost series
 |type        = inline
 |tag         = VisualEditor
 |tag_name    = visual editor
 |tag_pretext = the
}}

at the end of an article to create

For more Signpost coverage on the visual editor see our visual editor series.

If you think a topic would make a good series, but you don't see a tag for it, or that all the articles in a series seem 'old', ask for help at the WT:NEWSROOM. Many more tags exist, but they haven't been documented yet.

Links and such

By the way, the template that you're reading right now is {{Editnotices/Group/Wikipedia:Wikipedia Signpost/Next issue}} (edit). A list of the preload templates for Signpost articles can be found here.