Skip to main content
Skip table of contents

PII detection (AI-driven)

Features

GermainUX automatically detects PII in the data that it collects in real-time and relies on BERT-based AI models for that purpose.

Our findings on other models:

ModelName

Based on

License

Params

Entity Names for person/address

Impression

Notes

Evaluated formats

1-13-am/deberta-pii-finetuned

mit

184M

NAME_STUDENT, STREET_ADDRESS

good

aggregation_strategy='first' Whitespace not preserved

ctrlbuzz/bert-addresses

bert-base-cased

?

PER, addr

good

Only US addresses

text, JSON

h2oai/deberta_finetuned_pii

mit

FULLNAME, FIRSTNAME, MIDDLENAME, LASTNAME, STREETADDRESS

good

aggregation_strategy='first'

text, JSON

jammmmmm/pii

lakshyakh93/deberta_finetuned_pii

mit

?

FULLNAME, FIRSTNAME, LASTNAME, STREETADDRESS, USERNAME

good

aggregation_strategy='first'. Attempts to detect username/password

text

lakshyakh93/deberta_finetuned_pii

mit

?

FULLNAME, FIRSTNAME, LASTNAME, STREETADDRESS

good

aggregation_strategy='first'

kulkarni-harsh/address-extraction-ner

distilbert/distilbert-base-cased

mit

65M

LABEL_1+LABEL_2

limited

Only addresses, not persons

yonigo/distilbert-base-multilingual-cased-pii

apache-2.0

135M

BUILDING

limited

Only addresses, not persons

zmilczarek/pii-detection-roberta-v2

124M

NAME_STUDENT, ID_NUM

limited

aggregation_strategy='first' Only the street number and zipcode of an address

DioulaD/birdi-finetuned-ner-address-v2

camembert/camembert-base

poor

hewonty/longformer-ner-finetuned-pii

apache-2.0

148M

NAME_STUDENT

poor

Service: Analytics

Feature Availability: 2024.3

JavaScript errors detected

Please note, these errors can depend on your browser setup.

If this problem persists, please contact our support.