Skip to content

TowCenter/llm-text-embeddings

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LLM Text Embeddings

Open in GitHub Codespaces

Use this repo to generate text embeddings, allowing for interactive mapping to understand similarity across text.

Using the NICAR 2026 schedule, nicar-demo.ipynb generates numeric representations of text and plots them on a 2D chart. Users can also customize notebook to fit their own data's structure.

Setup

Add OpenAI API key to .env file.

OPENAI_API_KEY=""

Running the notebook

Open nicar-demo.ipynb. For the demo, no changes should be needed.

If you have your own data in CSV format, use semantic-map.ipynb update the configuration cell at the top of the notebook with your file path and column names. The notebook loads a CSV by default (Option A). If your data is a folder of .txt files instead, comment out Option A and uncomment Option B in the data loading cell.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 94.7%
  • Dockerfile 5.3%