Skip to content

This is Repo is related to Bioinformatic health science projects

Notifications You must be signed in to change notification settings

Aqsa48/Bioinformatic

Repository files navigation

Genomic Sequence Clustering and Association Rule Mining

This application allows users to perform genomic sequence clustering using k-means clustering and association rule mining. You can input genomic sequences, extract codons, apply clustering algorithms, and visualize the results.

Features:

  • Clustering: Cluster genomic sequences based on codon patterns using k-means clustering.
  • Visualization: Visualize clusters and association rules using various plotting techniques.
  • Interactive Interface: Easily interact with the app through user-friendly controls.

How to Use:

  1. Input a Genomic Sequence: Enter a genomic sequence (a string of nucleotides) to start the process.
  2. Choose Clustering Parameters: Select the number of clusters you want to create (e.g., 3 clusters).
  3. View Results: The app will display the clustered sequences and any association rules found.

Sample Dataset:

You can use the sample genomic sequences file named 2021_sequences.csv which is included in the folder for testing purposes. This file contains sequences that you can upload to the app for clustering and association rule mining.

Application Access:

You can access the app at the following URL:

Genomic Sequence Clustering and Association Rule Mining App

Feel free to explore the features and interact with the app to analyze genomic data.

Technologies Used:

  • Streamlit: For building the web application.
  • Scikit-learn: For machine learning and clustering.
  • Mlxtend: For association rule mining.
  • Matplotlib: For data visualization.
  • Pandas: For handling data.

Note: This app is built as part of a bioinformatics analysis tool. If you have any feedback or suggestions, feel free to reach out.

About

This is Repo is related to Bioinformatic health science projects

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages