Skip to content

Issue: Understand the dataset #37

@kyanmahajan

Description

@kyanmahajan

Issue: Understand the Dataset & Perform Initial Exploration

Objective

Before building any model, it is crucial to understand the dataset thoroughly.
This issue focuses on exploring the dataset structure, classes, distributions, and visual patterns in the images.


Dataset

Participants can download the dataset and work locally, or work on Kaggle (Recommended)/colab directly and upload the notebook.


Tasks to Perform

You must include (but are not limited to) the following:

  • Identify all classes and number of samples per class and Check class balance / imbalance [Plot the required plot]
  • Visualize random samples from each class
  • Analyze image resolutions, formats

Submission Guidelines

  1. Fork the repository
  2. Create a folder inside participants/ named exactly as your enrollment number
  3. Add your exploratory notebook(s) inside that folder
  4. Commit and push your changes
  5. Open a Pull Request referencing this issue
  6. Follow the template as in previous issues [ Issue: #<issue_number> ]
participants/
└── <your_enrollment_number>/
    └── data_exploration.ipynb

Metadata

Metadata

Assignees

No one assigned

    Labels

    Points: 30open-for-allAny one can work on it without getting assigned, every PR can merge

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions