Skip to content

UniRef #4

@NZ99

Description

@NZ99

The UniProt Reference Clusters (UniRef) are widely used for protein language model pre-training.

These include:

  • UniRef100, composed of all UniProtKB records plus UniParc records not present in UniProt, available at this link.
  • UniRef90, obtained by clustering UniRef100, available at this link.
  • UniRef50, obtained by clustering UniRef90, available at this link.

The .fasta files need to be processed (?) and made easily available.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions