Skip to content

Locality Sensitive Hashing #71

@jpcompartir

Description

@jpcompartir
  • Workflow (functions, tests, docs) for using LSH & Jaccard Sim for near duplicate identification and removal
  • Update the near_duplicates.Rmd vignette
  • Benchmark wtih spam_grams

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions