-
Notifications
You must be signed in to change notification settings - Fork 0
feat: add vibecoded but working convert_json_jsonld.py #23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR is being reviewed by Cursor Bugbot
Details
You are on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle.
To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.
|
Hi @rmfranken, I can see the conversion for |
|
Here are the conversion commands for each type - they are pretty much the same: A organization: The script auto-detects what kind of entity it's dealing with and processes it accordingly. The base-url is a bit too safe maybe - but we can see how easy it is to inject that into the call that we build. |
|
I think I want to make the URI's hashed - not blank nodes. Tentris is not dealing super well with them - not sure why - will investigate tomorrow. |
…ion.py pydantic stuff so we assign good author IRI's
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This PR is being reviewed by Cursor Bugbot
Details
You are on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle.
To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.
| org.linkedEntities = enrichment_data.organization_relations[org.legalName] | ||
| else: | ||
| org.academicCatalogRelations = [] | ||
| org.linkedEntities = [] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Bug: Data Identity Crisis: Authors Become Organizations
The organization-level enrichment loop iterates over self.data.author instead of the correct organizations field. This causes the code to process authors as if they were organizations, attempting to access org.legalName on Person objects which don't have that attribute. The loop should iterate over self.data.relatedToOrganizations instead.
| org.linkedEntities = enrichment_data.organization_relations[org.legalName] | ||
| else: | ||
| org.academicCatalogRelations = [] | ||
| org.linkedEntities = [] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Bug: Enrichment Loop Confuses Authors and Organizations
The organization-level enrichment loop iterates over self.data.author instead of the correct organizations field. This causes the code to process authors as if they were organizations, attempting to access org.legalName on Person objects which don't have that attribute. The loop should iterate over self.data.relatedToOrganizations instead.
Note
Add JSON→JSON-LD conversion and a full agents-based enrichment pipeline with supporting data models, API/analysis modules, utilities, tooling, and docs.
scripts/convert_json_jsonld.pyand docs for JSON→JSON-LD conversion, plus Tentris upload scripts (scripts/upload_all_to_tentris.sh,scripts/test_tentris_upload.sh).src/agents/**), including structured outputs, classifiers, context compilers, prompts, and validation.src/data_models/**) with conversion utilities.src/api.pyand analysis modules for users, orgs, repositories (src/analysis/**).src/context/**).src/llm/model_config.pyand agent management setup.src/utils/**).justfile,pyproject.toml, and extensive docs underdocs/and.cursor/rules/.tests/test_cache.py.Written by Cursor Bugbot for commit 0df57dc. This will update automatically on new commits. Configure here.