Web Reconnaissance & Downloader CLI

A robust Python command-line tool designed to crawl websites, detect downloadable files based on user-defined extensions, and download them with an interactive interface.

🚀 Features

Recursive Crawling: Scans entire website sections starting from a base URL, respecting user-defined depth and page limits.
Single Page Analysis: Analyzes a specific URL to extract immediate download links.
Auto-Dependency Management: Automatically checks and installs missing Python libraries upon launch.
Smart File Detection:
- Detects extensions in the URL path.
- Analyzes query parameters (e.g., ?file=report.pdf).
- Performs HEAD requests to inspect Content-Type and Content-Disposition headers for hidden files.
Stealthy: Uses randomized User-Agent strings to mimic real browsers.
Interactive UI: Numbered CLI menu to select files for download.
Resilient Downloading: Supports visual progress bars (tqdm) and handles file naming automatically.

📋 Prerequisites

Python 3.6 or higher.
An active internet connection.

The script will attempt to automatically install the following libraries if they are missing:

requests
beautifulsoup4
tqdm
fake_useragent

⚙️ Installation & Setup

Clone or Download the script to your local machine.

git clone https://github.com/Punkcake21/CliDownloader

cd CliDownloader

Run the script:
```
python CliDownloader.py
```
(OPTIONAL) Modify the extensions.txt file to your liking

💻 Usage

Upon running the script, follow the interactive prompts:

1. Select Operation Mode

You will be asked: [*] start crawler? (y/n) or 'q' to quit:

Crawler Mode (y):
- Enter the Start URL.
- Set Max Depth (e.g., 2 allows the crawler to follow links found on the first page).
- Set Max Pages to limit the scope.
- The tool will traverse the domain and aggregate all found files.
Single Page Mode (n):
- Enter the specific URL you want to analyze.
- The tool will only look for files linked on that specific page.

2. Logging Level

Select how much detail you want to see in the console:

q (Quiet): Only errors and critical info.
n (Normal): Standard progress updates (Default).
v (Verbose): Detailed debugging info (HTTP connections, parsing details).

3. Download Menu

Once scanning is complete, a list of files will appear:

======================================================================
         DOWNLOADS FOUND ON WEB PAGE
======================================================================
     [No.] | FILE NAME
----------------------------------------------------------------------
    [ 1] | annual_report_2024.pdf
    [ 2] | dataset_backup.zip
----------------------------------------------------------------------

Enter file No. to download, 'r' to refresh list, 'n' for new URL, 'q' to quit:

Enter a number (e.g., 1) to download that specific file.
Enter r to refresh the list.
Enter n to enter a new URL.
Enter q to quit the program.

📂 Project Structure

.
├── CliDownloader.py        # Main application script
├── extensions.txt          # CONFIG FILE
└── /Downloads              # Files are saved in this folder

⚠️ Disclaimer

This tool is intended for educational purposes and legitimate reconnaissance. The author is not responsible for any misuse or violation of terms of service of third-party websites. Always ensure you have permission before crawling or downloading data from external domains.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.gitignore		.gitignore
CliDownloader.py		CliDownloader.py
LICENSE		LICENSE
README.md		README.md
extensions.txt		extensions.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Reconnaissance & Downloader CLI

🚀 Features

📋 Prerequisites

⚙️ Installation & Setup

💻 Usage

1. Select Operation Mode

2. Logging Level

3. Download Menu

📂 Project Structure

⚠️ Disclaimer

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Web Reconnaissance & Downloader CLI

🚀 Features

📋 Prerequisites

⚙️ Installation & Setup

💻 Usage

1. Select Operation Mode

2. Logging Level

3. Download Menu

📂 Project Structure

⚠️ Disclaimer

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages