This is a project that combines a Java Spring Boot web application and a Python-based web scraper for data analysis.
- Java Spring Boot: NLP analysis, web application features.
- Python Web Scraping: Scrapes health-related websites for data analysis.
- Website: MedlinePlus Encyclopedia
- Install dependencies using Maven
- Install from
pom.xmlusingmvn clean install. - Run the Spring Boot application using
mvn spring-boot:run. - Access the local webpage using
http://localhost:8080/input.
- Install dependencies:
pip install -r requirements.txt. - Run the web scraper:
python web.py- raw data collected onscraped_data.xlsx - Data Analysis on Jupyter Notebook:
stat.ipynb- cleaned data collected oncleaned_data.xlsx
- Java 11+
- Maven
- Python 3.7+