This application allows users to perform genomic sequence clustering using k-means clustering and association rule mining. You can input genomic sequences, extract codons, apply clustering algorithms, and visualize the results.
- Clustering: Cluster genomic sequences based on codon patterns using k-means clustering.
- Visualization: Visualize clusters and association rules using various plotting techniques.
- Interactive Interface: Easily interact with the app through user-friendly controls.
- Input a Genomic Sequence: Enter a genomic sequence (a string of nucleotides) to start the process.
- Choose Clustering Parameters: Select the number of clusters you want to create (e.g., 3 clusters).
- View Results: The app will display the clustered sequences and any association rules found.
You can use the sample genomic sequences file named 2021_sequences.csv
which is included in the folder for testing purposes. This file contains sequences that you can upload to the app for clustering and association rule mining.
You can access the app at the following URL:
Genomic Sequence Clustering and Association Rule Mining App
Feel free to explore the features and interact with the app to analyze genomic data.
- Streamlit: For building the web application.
- Scikit-learn: For machine learning and clustering.
- Mlxtend: For association rule mining.
- Matplotlib: For data visualization.
- Pandas: For handling data.
Note: This app is built as part of a bioinformatics analysis tool. If you have any feedback or suggestions, feel free to reach out.