Wikipedia:Wikipedia Signpost/2010-09-13/Sister projects: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
queried is more accurate than contained
C-e; trimmed and fixed up the quotes from foreign correspondents, including typographical issues (commas/periods wrong in numbers, for example)
Line 1:
<noinclude>{{Wikipedia:Signpost/Template:Signpost-header||Opinion|}}</noinclude>
 
{{Wikipedia:Signpost/Template:Signpost-article-start|{{{1|DeathBiography anomaliesbloopers: indead Wikipedia biographies:or follow-up}alive?}}|By [[User:WereSpielChequers|WereSpielChequers]]|7 September 2010}}
 
InJust Julyover a month ago, '''The Signpost'' [[Wikipedia:Wikipedia Signpost/2010-07-26/News and notes|published a story]] introducingon the [[meta:Death anomalies table|Death Anomalies project]], which attempts to identifyidentifies anomalies wherebetween different language versions of WikipediaWikipedias disagree as to whether an individual is dead or alive. The ProjectP\project was started in June of this year, and at the time the story was published, only the German and English language versions of WikipediaWikipedias were actively extracting reports of anomalies. Since then, the Latin, Swedish, and Slovenian Wikipedias have joined in, and hundreds of errors and anomalies have been resolved. When we covered the project was announced on ''The Signpost'', a number of readers pitched in; and the number of anomalies on the English Wikipedia reportenwiki was slashed from 447 to 190 in a littlejust over a week. The English WikipediaEnwiki still has overmore than a 100 anomalies on [[Wikipedia:Database reports/Living people on EN wiki who are dead on other wikis]], with new onesreports coming in every daydaily. However, most of the backlog is down to differences in the way different projects treat missing people, people who (if alive) would be overmore than 100 years old, cross-wiki anomalies stemming from unreferenced article showing a person as dead, and issues that probably require a Russiannative or Japaneseforeign-language speaker to resolve.
 
In July, only two projects were extracting data from the table, though it queried data from around 70. Subsequently these have been joined by the Swedish Wikipedia [http://sv.wikipedia.org/w/index.php?title=Wikipedia:Projekt_levande_personer/Eventuellt_avlidna&action=history which rapidly reduced 94 anomalies to 16], and the Latin wikipedia, which has managed to [http://la.wikipedia.org/w/index.php?title=Vicipaedia:Mortui_dicti&action=history reduce its anomalies to one]. So far this month the [[:sl:Wikipedija:Biografije živečih oseb/Domnevno umrli|Slovene Wikipedia]] has become the fifth participating project.
 
In regards to biographiesBiographies of living people (BLPs), oneinevitably hasneed to eventuallybe updateupdated the biography becausewhen the subject has dieddies, so all these reports are expected to be ongoing maintenance tasks. Although the bot is processing data from millions of biographies across different languages versions of WikipediaWikipedias, lessfewer than a thousand anomalies have been identified so far. The process, reliesrelying on [[Interwiki links]] and categories that identify biographies as either dead or living. Some projects are ineligible for the program because they don't organise their articles in such a way.; Forfor example, the Portuguese Wikipedia have lists of people who died in particular years (rather than categories).
 
In the future, the number of languages from which data is extracted and number of languages requesting reports will hopefully increase; we have 66 Wikipedia language versions including French, Spanish, Japanese, Polish and Russian for whom reports could be extracted almost immediately. [[User:Merlissimo|Merlissimo]] has a bot that updates the reports daily, and is willing to produce reports for other projects.
 
=== User responses ===
{{quotecquote|The '''Swedish Wikipedia''' is fertile ground for anya project of this kind. After some years of rapid growth in the number articles, attention swingedswung to quality and structure in 2008. Biographic articles were [[:sv:Kategori:Personer efter kön|exhaustively categorized by gender]] in the fall[northern autumn] of 2008, revealing that there are four male biographies for each female one, and by years of birth and death in 2009. This is also when the [[:sv:Kategori:Levande personer|category for living people]] was created and a [[:sv:Wikipedia:Projekt levande personer|WikiProject for living people]] waswere started. The "death anomalies" report was set up as a subpage to this WikiProject, named "[[:sv:Wikipedia:Projekt levande personer/Eventuellt avlidna|possibly deceased]]" people. Of course there are contributors who write new articles, but there is also an active community of users who categorize and verify the information... The Swedish Wikipedia has also benefittedbenefited from [http://toolserver.org/~sk/cw/index.htm Check Wikipedia], a daily report of wiki-syntax errors, and would welcome similar projects. --[[User:LA2|LA2]] ([[User talk:LA2|talk]]) 20:06, 7 September 2010 (UTC)}}
 
{{quotecquote|Although the '''Latin wikipediaWikipedia''' ([[:la:|la.wikipedia]]) uses a language with a long history, a large portion of its articles cover modern topics, including (of course) biographies of living peopleBLPs.... In figures: [Of the] about 4400044,000 articles available in the Latin wikipedia today, about 4300 (or roughly ten percent) are biographiesBLPs. The death anomalies table adds an extra level of livingreliability peopleto BLPs on the [English, German, Swedish and Latin Wikipedias]. It is great to see more and more tools are available that permit semantic checks and analyses of information ... the future is not just isolated wikitext articles, but a flexible repository of semantic information. The death anomalies table shows a glimpse of what might be possible in the future, when we will have at our disposal not only (wiki)text but also rich, usefully structured information and data. A big thanks to all the volunteers (including the [[Wikipedia:Wikipedia Signpost/2010-07-26/Sister projects#Feedback|rock star]]) who make this possible! [[User:UV|UV]]}}
 
{{quotecquote|The '''Slovenian Wikipedia''' has a relatively large proportion of biographies, of which there are overmore than 8.,000 in the "Living people" categoryBLPs (almost 10 % of total article count). Many of those articles have been added semi-automatically and we have a small community of active contributors. Consequently, there are a lot of[many articles that] aren't regularly maintained, which is why this tool will certainly prove extremely useful for easing the burden of keeping the content up-to-date. This means less work when the focus shifts from adding content to improving the quality one day, and improved reliability of the work until then. Thanks to all the developers in the name of Slovenian Wikipedia community. — [[User:Yerpo|Yerpo]] <sup>[[User talk:Yerpo|Eh?]]</sup> 08:19, 12 September 2010 (UTC)}}
The death anomalies table adds an extra level of reliability to biographies of living people on the many wikipedias in different languages, including the English, German, Swedish and Latin wikipedias. It is great to see that more and more tools are available that permit semantic checks and analyses of information in the wikipedias – the future is not just isolated wikitext articles, but a flexible repository of semantic information! The death anomalies table shows a glimpse of what might be possible in the future, when we will have at our disposal not only (wiki)text but also rich, usefully structured information and data. A big thanks to all the volunteers (including the [[Wikipedia:Wikipedia Signpost/2010-07-26/Sister projects#Feedback|rock star]]) who make this possible! --[[User:UV|UV]] ([[User talk:UV|talk]]) 21:24, 7 September 2010 (UTC)}}
 
{{quote|The script'''German Wikipedia''' has more than 340,000 articles about people that include [[:de:Hilfe:Personendaten|machine-readable data]] usable by external projects. The local report covers all people (not only living people) and is runningforwarded to 150 WikiProjects filtered by subject area. The script runs on the toolserver and uses the [[tswiki:Batch job scheduling|sun grid engine]] for efficient resource handling. About 1,.9 million interwiki relations are checked every day for creating reports on five wikipediasWikipedias. [[User:Merlissimo/Sig|Merl]][[User talk:Merlissimo/Sig|issimo]]}}
{{quote|Slovenian Wikipedia has a relatively large proportion of biographies, of which there are over 8.000 in the "Living people" category (almost 10 % of total article count). Many of those articles have been added semi-automatically and we have a small community of active contributors. Consequently, there are a lot of articles that aren't regularly maintained, which is why this tool will certainly prove extremely useful for easing the burden of keeping the content up-to-date. This means less work when the focus shifts from adding content to improving the quality one day, and improved reliability of the work until then. Thanks to all the developers in the name of Slovenian Wikipedia community. — [[User:Yerpo|Yerpo]] <sup>[[User talk:Yerpo|Eh?]]</sup> 08:19, 12 September 2010 (UTC)}}
 
{{quote|German Wikipedia has more than 340.000 articles about persons containing also [[:de:Hilfe:Personendaten|machine readable data]] which can be used by external projects. The local report covers all people (not only living people) and is forwarded to 150 WikiProjects filtered by their subject area.
 
The script is running on the toolserver and uses the [[tswiki:Batch job scheduling|sun grid engine]] for efficient resource handling. About 1,9 million interwiki relations are checked every day for creating reports on five wikipedias. [[User:Merlissimo/Sig|Merl]][[User talk:Merlissimo/Sig|issimo]]}}
 
<noinclude>{{Wikipedia:Signpost/Template:Signpost-article-comments-end||2010-09-06|2010-09-20}}</noinclude>

Revision as of 10:36, 13 September 2010