DBpedia: Difference between revisions

From Meta, a Wikimedia project coordination wiki
Content deleted Content added
cat
m →‎DBpedia Ontology: link disambiguation
 
(20 intermediate revisions by 12 users not shown)
Line 1: Line 1:
'''DBpedia''' is a community effort to extract structured information from [[Wikipedia]] and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia and to interlink other datasets on the Web with DBpedia data.
'''dbpedia''' is a project advertised in January 2007 by Sören Auer (Universität Leipzig) et al. that digs out data from Wikipedia's articles and makes it searchable in an RDF metadata framework. It appears to deliver some of the promises of [[Wikidata]], albeit with a different technology.


Information about the DBpedia project and dataset can be found here:
==External links==
* [[:w:en:DBpedia|The DBpedia article on Wikipedia]]
*http://dbpedia.org/
* [http://dbpedia.org Project site]
*[http://lists.wikimedia.org/pipermail/wikitech-l/2007-January/029061.html dbpedia.org - Querying Wikipedia like a Semantic Database], announcement on wikitech-l


===DBpedia on MetaWiki===
Several pages were created on MetaWiki to aid in extracting information from Wikipedia.
Semi-structured information can be found on Wikipedia in articles with infoboxes.
The main problem is that infoboxes are generally designed to appeal to human users, when rendered.
For machines it is much harder to access the information in infoboxes and extract meaningful and useful data.
To ease this process two different pages were created in the [[User:DBpedia-Bot]] namespace:

====DBpedia Ontology====
The [[User:DBpedia-Bot/ontology| DBpedia Ontology]] aims to be a common scheme for articles (and especially infoboxes of articles) in Wikipedia.
It is a repository of unique names and identifiers (vocabulary) for the strings used in infoboxes and aims to merge together what belongs together and separate strings with different meaning.

A simple example:
On the English Wikipedia's infoboxes there are 27 different variants of [[User:DBpedia-Bot/ontology/birthPlace|birthPlace]] such as birthplace, placeOfBirth, born,
which all have the same meaning, i.e. telling the place where a person was born.
On the other hand, in [[Wikipedia:Bjork]] for example '''born''' is used for [[User:DBpedia-Bot/ontology/birthPlace|birthDate]] and [[User:DBpedia-Bot/ontology/birthPlace|birthPlace]] at the same time.

The [[User:DBpedia-Bot/ontology| DBpedia Ontology]] provides a single name for each meaning and thus makes it easier for machines to extract information correctly.
Besides names for properties it also contains classes, which provide a strict categorisation system for articles.

More about the ontology can be found on the [[User:DBpedia-Bot/ontology| DBpedia Ontology page]] and the [[User:DBpedia-Bot/ontology-overview| Ontology overview page ]].

It adheres to common Semantic Web standards like [[Wikipedia:RDF| RDF]] and [[Wikipedia:Web Ontology Language| OWL]].

====DBpedia Mapping====
As the [[User:DBpedia-Bot/ontology| DBpedia Ontology]] provides a repository for semantically well-defined identifiers, it is now possible to create a [[User:DBpedia-Bot/mapping| mapping]] from infoboxes and templates to the ontology.

Each infobox template (such as [[Wikipedia:Template:Infobox_Musical_artist]]) can be assigned an OWL class from the ontology.
Furthermore, each template parameter (such as Background, Born, Died, Origin) can be mapped to an ontology property.

More information about the can soon be found on the [[User:DBpedia-Bot/mapping| mapping page]].

====Synchronization====

The changes made to subpages of [[User:DBpedia-Bot/mapping]] and [[User:DBpedia-Bot/ontology]]
are synchronized with a Semantic Web triple store and take effect within minutes.

The ontology can be downloaded with this link:
[http://dbpedia-live.openlinksw.com/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org%2Fmeta&should-sponge=&query=+CONSTRUCT+{%3Fs+%3Fp+%3Fo}+where%0D%0A{%0D%0A%3Fb+%3Chttp%3A%2F%2Fdbpedia.org%2Fmeta%2Forigin%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fmeta%2FTBoxExtractor%3E+.%0D%0A%3Fb+owl%3AannotatedSource+%3Fs+.%0D%0A%3Fb+owl%3AannotatedProperty+%3Fp+.%0D%0A%3Fb+owl%3AannotatedTarget+%3Fo+.%0D%0AFilter+%28!+%28%3Fp+in+%28%0D%0A%3Chttp%3A%2F%2Fdbpedia.org%2Fmeta%2Feditlink%3E%2C+%0D%0A%3Chttp%3A%2F%2Fdbpedia.org%2Fmeta%2FeditLink%3E%2C+%0D%0A%3Chttp%3A%2F%2Fdbpedia.org%2Fmeta%2Frevisionlink%3E%2C%0D%0A%3Chttp%3A%2F%2Fdbpedia.org%2Fmeta%2FrevisionLink%3E%2C%0D%0A%3Chttp%3A%2F%2Fdbpedia.org%2Fmeta%2Foaiidentifier%3E%29%29%29.%0D%0A}+&format=application%2Frdf%2Bxml&debug=on&timeout= link] and viewed in standard Ontology editors like [[Wikipedia:Protégé_(software)]]


[[Category:DBpedia|*]]
[[Category:German engineering]]
[[Category:German engineering]]
[[Category:Research]]
[[Category:Research]]
[[category:Wikidata]]
[[category:Wikidata]]
[[Category:Wikimedia_projects]]

Latest revision as of 12:36, 27 August 2016

DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia and to interlink other datasets on the Web with DBpedia data.

Information about the DBpedia project and dataset can be found here:

DBpedia on MetaWiki[edit]

Several pages were created on MetaWiki to aid in extracting information from Wikipedia. Semi-structured information can be found on Wikipedia in articles with infoboxes. The main problem is that infoboxes are generally designed to appeal to human users, when rendered. For machines it is much harder to access the information in infoboxes and extract meaningful and useful data. To ease this process two different pages were created in the User:DBpedia-Bot namespace:

DBpedia Ontology[edit]

The DBpedia Ontology aims to be a common scheme for articles (and especially infoboxes of articles) in Wikipedia. It is a repository of unique names and identifiers (vocabulary) for the strings used in infoboxes and aims to merge together what belongs together and separate strings with different meaning.

A simple example: On the English Wikipedia's infoboxes there are 27 different variants of birthPlace such as birthplace, placeOfBirth, born, which all have the same meaning, i.e. telling the place where a person was born. On the other hand, in Wikipedia:Bjork for example born is used for birthDate and birthPlace at the same time.

The DBpedia Ontology provides a single name for each meaning and thus makes it easier for machines to extract information correctly. Besides names for properties it also contains classes, which provide a strict categorisation system for articles.

More about the ontology can be found on the DBpedia Ontology page and the Ontology overview page .

It adheres to common Semantic Web standards like RDF and OWL.

DBpedia Mapping[edit]

As the DBpedia Ontology provides a repository for semantically well-defined identifiers, it is now possible to create a mapping from infoboxes and templates to the ontology.

Each infobox template (such as Wikipedia:Template:Infobox_Musical_artist) can be assigned an OWL class from the ontology. Furthermore, each template parameter (such as Background, Born, Died, Origin) can be mapped to an ontology property.

More information about the can soon be found on the mapping page.

Synchronization[edit]

The changes made to subpages of User:DBpedia-Bot/mapping and User:DBpedia-Bot/ontology are synchronized with a Semantic Web triple store and take effect within minutes.

The ontology can be downloaded with this link: link and viewed in standard Ontology editors like Wikipedia:Protégé_(software)