Releases: vmenger/deduce
Releases · vmenger/deduce
v2.4.2
v2.4.1
v2.4.0
v2.3.1
v2.3.0
2.3.0 (2023-10-25)
Added
- lookup lists (and logic) for Dutch provinces, regions, municipalities and streets
Changed
- name of
residencesannotator toplacenames, now includes provinces, regions and municipalities - lookup lists (and logic) for residences
- logic for streets, housenumber and housenumber letters
v2.2.0
2.2.0 (2023-09-28)
Changed
- tokenizer logic:
- a token is now a sequence of alphanumeric characters, a single newline, or a single special character.
- whitespaces are no longer considered tokens
- moved token pattern logic to config, using a new
TokenPatternAnnotator - moved context pattern logic to config, using a new
ContextAnnotator - many updates to name detection logic
- lookup list optimizations
- added, removed and simplified patterns
v2.1.0
2.1.0 (2023-08-07)
Added
- a component for deidentifying BSN-nummers
Changed
- updated dependencies
- by default, deduce now recognizes and tags bsn nummers
- by default, deduce now recognizes all other 7+ digit numbers as identifiers
- improved regular expressions for e-mail address and url matching, with separate tags
- logic for detecting phone numbers (improvements for hyphens, whitespaces, false positive identifiers)
- improved regular expression for age matching
- date detection logic:
- now only recognizes combinations of day, month and year (day/month combinations caused many false positives)
- detects year-month-day format in addition to (day-month-year)
- loading a custom config now only replaces the config options that are explicitly set, using defaults for those not included in the custom config
Fixed
- annotations can no longer be counted as adjacent when separated by newline or tab (and will thus not be merged)
Removed
- a separate patient identifier tag, now superseded by a generic tag
- detection of day/month combinations for dates, as this caused many false positives (e.g. lab values, numeric scores)
Deprecated
- backwards compatibility, which was temporary added to transition from v1 to v2