Skip to content

Releases: vmenger/docdeid

v1.0.1

06 May 07:56
13ce865

Choose a tag to compare

1.0.1 (2025-05-06)

Changed

  • updated dependencies

v1.0.0

20 Dec 10:04
f9fb9fb

Choose a tag to compare

1.0.0 (2023-12-20)

Added

  • some internal speedups for SingleTokenLooupAnnotator, MultiTokenLookupAnnotator and LookupTrie
  • caching for sorting annotations, which helps with speed
  • the pre_match_words attribute for RegexpAnnotator
  • the option to provide a LookupTrie to a MultiTokenAnnotator directly
  • a method for getting all words or, for looking up tokens with specific text values in a TokenList, with options for matching_pipeline
  • automated build/publish on merge to main

Changed

  • sorting Annotation and AnnotationSet now requires sort key to be provided as a tuple, and callbacks as a frozendict
  • renamed docdeid.tokenize to docdeid.tokenizer
  • renamed docdeid.process.doc to docdeid.process.doc_processor
  • renamed docdeid.process.annotation_set to docdeid.process.annotation_processor
  • Annotation and Token now only include int/str fields when serializing
  • formatting and linting settings
  • moved the logic for linking tokens to TokenList rather than Tokenizer
  • use casefold() instead of lower() for lowercasing

Fixed

  • a bug with overlapping annotations in MultiTokenLookupAnnotator

Removed

  • automated coverage reporting

v0.1.10

28 Nov 12:48
7b0fb20

Choose a tag to compare

0.1.10 (2023-11-28)

Added

  • RegexpAannotator accepts regexp strings in addition to compiled regexp patterns

Changed

  • consisent use of args and kwargs in Annotator class tree
  • RegexpAnnotator now offers function to validate matches, implementable by subclassing

v0.1.9

20 Oct 20:02
b7b8aa7

Choose a tag to compare

0.1.9 (2023-10-20)

Changed

  • made the priority attribute of an Annotation non-Optional
  • multi token lookup now sets the start_token and end_token fields of an Annotation

Fixed

  • a bug with determnistic sort, when Optional fields were set

v0.1.8

01 Aug 13:29
6ee543c

Choose a tag to compare

0.1.8 (2023-08-01)

Added

  • an additional priority attribute for Annotation, giving an extra option for sorting

v0.1.7

26 Jul 09:11
01aab4d

Choose a tag to compare

0.1.7 (2023-07-26)

Changed

  • upgraded dependencies

v0.1.6

28 Mar 14:39
a3e8b37

Choose a tag to compare

0.1.6 (2023-03-28)

Changed

  • upgraded dependencies, including a markdown-it-py which had a vulnerability

v0.1.5

15 Feb 13:57
71eb3be

Choose a tag to compare

0.1.5 (2023-02-15)

Changed

  • upgraded dependencies, including a certifi which had a vulnerability

v0.1.4

29 Nov 09:48
8590137

Choose a tag to compare

0.1.4 (2022-11-29)

Changed

  • renamed processors_enabled and processors_disabled to enabled and disabled, respectively

v.0.1.3

28 Nov 10:17
ea3b958

Choose a tag to compare

0.1.3 (2022-11-28)

Added

  • Include py.typed in packaging