Skip to content

HiLab-git/VLM-CPL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

VLM-CPL

Overall Framework

There are three steps to train the classification network with selected high-quality pseudo labels. overall

Data prepare

Download the HPH dataset here
Download the LC25K dataset here
Download the CRC100K dataset here
Download the DigestPath dataset here

Using a 4:1 split for training and testing.

Training process

First, use the on-the-shelf VLM for zero-shot inference with our proposed method to filter out noisy samples on the training set.
In the vlm_cpl_LC25K.py file, there are two main functions, MVCandPrompt_feature_consensus.
You can use the combination of MVCandPrompt_feature_consensus or either one alone. You can also adjust the order of these two filters.

python vlm_cpl_LC25K.py --gpu 0

Second, after obtaining high-quality pseudo-labels, you can train a classification network.

python train_pseudo.py --gpu 0 --pseudo_csv <your_csv>

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages