Skip to content

Latest commit

 

History

History
31 lines (26 loc) · 1.39 KB

File metadata and controls

31 lines (26 loc) · 1.39 KB

Python-Keyword-Extractor

This is a Python-based web scraping tool for keyword extraction and content analysis. The script automates Google search, scrapes relevant webpage text, identifies top keywords, and exports results to CSV. Ideal for SEO research, content analysis, and competitive insights. User-friendly and customizable.

Keyword Extraction and Web Scraping Project

Overview

This project automates the extraction of keywords and relevant text from webpages using web scraping techniques. It is designed for tasks like content analysis, SEO research, and competitive analysis.

Features

  • Automated Search: Uses Google search to find relevant URLs for specific keywords.
  • Content Scraping: Extracts visible text from headers, descriptions, and main content of webpages.
  • Keyword Analysis: Identifies the most significant keywords while excluding common stopwords.
  • CSV Export: Outputs results in a structured CSV format for easy analysis.

How It Works

  1. Specify your search keywords in the script.
  2. Define domains to exclude (e.g., Amazon, eBay).
  3. Run the script to perform web scraping and text processing.
  4. Review the extracted data in the generated CSV file.

Requirements

  • Python 3.x
  • Libraries:
    • requests
    • BeautifulSoup4
    • nltk
    • tldextract
    • csv

Install dependencies using:

pip install requests beautifulsoup4 nltk tldextract