Skip to content

[Feature Request] Add a small "document-to-KG conversion failure modes" note (docs only) #43

@onestardao

Description

@onestardao

Hi AutoSchemaKG team,

Thank you for releasing AutoSchemaKG. Automatically converting documents into schema and knowledge graphs is a powerful upstream component for many RAG systems.

I have been working on failure-mode maps for RAG and data pipelines and recently contributed a robustness-related entry to Harvard MIMS Lab’s ToolUniverse. In document-to-KG workflows, I often see recurring problems:

  • important entities or relations are dropped during conversion
  • schema choices introduce hidden biases in what can be represented
  • downstream RAG systems fail in ways that are hard to trace back to KG conversion errors

I would like to propose a small, documentation-only failure-mode note for AutoSchemaKG.

Proposed feature

Add a short markdown page named for example:

doc_to_kg_conversion_failure_modes.md

The page could:

  1. List typical failure modes when converting documents to KGs:
    • missing entities or relations
    • incorrect grounding
    • schema gaps.
  2. For each, describe:
    • symptoms users might see
    • likely causes (parsing, mapping, schema design)
    • basic diagnostics or sanity checks.
  3. Provide a short checklist for users who plan to feed the resulting KG into RAG systems.

Motivation

  • AutoSchemaKG is an upstream component for retrieval and reasoning systems.
  • A small failure-mode note would help users interpret downstream errors and design better schemas and checks.
  • This is a docs-only change and can be gradually refined.

If this seems useful, I would be happy to draft a concise version in a PR.

Thank you for considering.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions