Script Meta-Fields Extractor

    __  ___     __           _______      __    __        ______     __                  __            
   /  |/  /__  / /_____ _   / ____(_)__  / /___/ /____   / ____/  __/ /__________ ______/ /_____  _____
  / /|_/ / _ \/ __/ __ `/  / /_  / / _ \/ / __  / ___/  / __/ | |/_/ __/ ___/ __ `/ ___/ __/ __ \/ ___/
 / /  / /  __/ /_/ /_/ /  / __/ / /  __/ / /_/ (__  )  / /____>  </ /_/ /  / /_/ / /__/ /_/ /_/ / /    
/_/  /_/\___/\__/\__,_/  /_/   /_/\___/_/\__,_/____/  /_____/_/|_|\__/_/   \__,_/\___/\__/\____/_/

Description

The Script Meta-Fields Extractor is a Python-based tool designed to extract metadata (field names, data types, and example values) from a variety of file formats, including:

CSV
Excel (.xls, .xlsx)
JSON
XML
Parquet
QVD (with the version 1.1)

The tool processes data files located in the inputs directory and generates metadata reports saved in the outputs directory.

Getting Started

Prerequisites

Ensure you have the following installed:

Python 3.6+
pip (Python package manager)

Installation

Clone this repository:

git clone <repository_url>
cd script-meta-fields-extractor

Install the required Python libraries:
```
pip install -r requirements.txt
```
Set up the folder structure:
- Ensure the inputs folder exists and place your data files inside it.
- The script will automatically create the outputs folder if it doesn't exist.

Usage

Place your data files (e.g., sample.csv, data.json) in the inputs folder.
Run the script:
```
python data_info_extractor.py
```
Follow the prompts to select a file for analysis.
The tool will display the metadata (field names, types, examples) in the terminal and save the output to the outputs directory.

Project Structure

SCRIPT-META-FIELDS-EXTRACTOR/
├── inputs/                 # Input folder containing data files (CSV, JSON, etc.)
│   ├── sample.csv
│   ├── sample.json
│   ├── sample.parquet
│   ├── sample.xls
│   └── sample.xml
├── outputs/                # Output folder for processed metadata reports
│   ├── .gitignore          # Ignores unnecessary files
├── data_info_extractor.py  # Main Python script for metadata extraction
├── LICENSE                 # License information
├── README.md               # Project documentation
├── requirements.txt        # Python dependencies

Contributing

We welcome contributions to improve this project! To contribute:

Fork the repository.
Create a new branch (feature/your-feature-name).
Commit your changes with clear and concise messages.
Push to your branch and open a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Acknowledgements

OpenAI's o1 for assisting with the refactoring and creating this README.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Script Meta-Fields Extractor

Table of Contents

Description

Getting Started

Prerequisites

Installation

Usage

Project Structure

Contributing

License

Acknowledgements

About

Uh oh!

Releases 2

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
inputs		inputs
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_info_extractor.py		data_info_extractor.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Script Meta-Fields Extractor

Table of Contents

Description

Getting Started

Prerequisites

Installation

Usage

Project Structure

Contributing

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages