A powerful research trend discovery tool that fetches and clusters AI topics from arXiv, GitHub, Reddit, and Exa. Built using Python, NLP, and Streamlit.
- Unified search across 4 AI research platforms
- KMeans clustering to group topics
- Keyword extraction using KeyBERT
- Ranking based on source priority, relevance, and recency
- Google-style UI built with Streamlit
- Secure API key management using
.env
- Python 3.10+
- Streamlit
- scikit-learn (TF-IDF, KMeans)
- KeyBERT for keyword extraction
- PRAW (Reddit API), requests, dotenv
- User enters a research query.
- API calls made to arXiv, GitHub, Reddit, Exa.
- Fetched text is cleaned and vectorized using TF-IDF.
- KMeans clustering groups similar content.
- KeyBERT extracts keywords to name clusters.
- Results ranked by source, relevance, and recency.
- Top 5 clusters with 10 trends each are displayed in Streamlit UI.
git clone https://github.com/amith-exe/Trendi-Ai.git
cd Trendi-Ai