Skip to content

Missing rdf:types in HTML view and in some SPARQL answerse #820

@jmkeil

Description

@jmkeil

Issue validity

Could not check, due to DIEF ERROR 502 response.

Error Description

Many resources are missing some rdf:type statements.

Pinpointing the source of the error

It affects the SPARQL (http://dbpedia.org/sparql) endpoint and the HTML (http://dbpedia.org/resource/…) representation. I did not check further sources.

Details

It seems to affect many different resources. I will point it out on the example of resources representing astronauts.

  • Astronauts should have the rdf:type dbo:Astronaut, but some do not, e.g. https://dbpedia.org/page/Anne_McClain

  • some other resources are not affected, e.g. https://dbpedia.org/page/John_Young_(astronaut)

  • Wikipedia sources of affected and not affected resources contain the Infobox astronaut

  • the Infobox astronaut is properly mapped: https://mappings.dbpedia.org/index.php/Mapping_en:Infobox_astronaut

  • dbo:Astronaut defined on https://mappings.dbpedia.org/index.php/OntologyClass:Astronaut as equivalent class of http://www.wikidata.org/entity/Q11631

  • affected and not affected resources have the rdf:type http://www.wikidata.org/entity/Q11631

  • http://dbpedia.org/sparql gives inconsistent answers about the types:

    SELECT * WHERE {
    	<http://dbpedia.org/resource/Anne_McClain> rdf:type ?type
    }
    

    -> dbo:Astronaut not contained, wikidata:Q11631 contained

    DESCRIBE <http://dbpedia.org/resource/Anne_McClain>
    

    -> dbo:Astronaut not contained, wikidata:Q11631 contained

    ASK {
    	<http://dbpedia.org/resource/Anne_McClain> rdf:type dbo:Astronaut .
    }
    

    -> true

    ASK {
    	<http://dbpedia.org/resource/Anne_McClain> rdf:type <http://www.wikidata.org/entity/Q11631> .
    }
    

    -> true

    SELECT ?case (COUNT(DISTINCT ?entity) AS ?COUNT) WHERE {
    	?entity a <http://www.wikidata.org/entity/Q11631> .
    	FILTER NOT EXISTS { ?entity a dbo:Astronaut . }
    	BIND("only WD" AS ?case)
    }
    GROUP BY ?case
    

    -> 0

    SELECT ?case (COUNT(DISTINCT ?entity) AS ?COUNT) WHERE {
    	?entity a dbo:Astronaut .
    	?entity a <http://www.wikidata.org/entity/Q11631> .
    	BIND("DBO and WD" AS ?case)
    }
    GROUP BY ?case
    

    -> 713

    SELECT ?case (COUNT(DISTINCT ?entity) AS ?COUNT) WHERE {
    	?entity a dbo:Astronaut .
    	FILTER NOT EXISTS { ?entity a <http://www.wikidata.org/entity/Q11631> . }
    	BIND("only DBO" AS ?case)
    }
    GROUP BY ?case
    

    -> 26

    SELECT ?case (COUNT(DISTINCT ?entity) AS ?COUNT) WHERE {
    	?entity a dbo:Astronaut .
    	BIND("DBO" AS ?case)
    }
    GROUP BY ?case
    

    -> 739

    SELECT ?case (COUNT(DISTINCT ?entity) AS ?COUNT) WHERE {
    	?entity a <http://www.wikidata.org/entity/Q11631> .
    	BIND("WD" AS ?case)
    }
    GROUP BY ?case
    

    -> 713

As the HTML rendering is consistent with the DESCRIBE and the not aggregating SELECT queries, this likely is an SPARQL endpoint error. However, an extraction can also not be excluded completely.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions