Skip to content
  • Categories
  • Recent
  • Tags
  • Popular
  • World
  • Users
  • Groups
Skins
  • Light
  • Brite
  • Cerulean
  • Cosmo
  • Flatly
  • Journal
  • Litera
  • Lumen
  • Lux
  • Materia
  • Minty
  • Morph
  • Pulse
  • Sandstone
  • Simplex
  • Sketchy
  • Spacelab
  • United
  • Yeti
  • Zephyr
  • Dark
  • Cyborg
  • Darkly
  • Quartz
  • Slate
  • Solar
  • Superhero
  • Vapor

  • Default (Cyborg)
  • No Skin
Collapse
Brand Logo

CIRCLE WITH A DOT

  1. Home
  2. Uncategorized
  3. #OpenAI releases #PrivacyFilter β€” an open-weight #AI model for detecting & redacting #PII in text.

#OpenAI releases #PrivacyFilter β€” an open-weight #AI model for detecting & redacting #PII in text.

Scheduled Pinned Locked Moved Uncategorized
openaiprivacyfilterpiiopensource
5 Posts 2 Posters 0 Views
  • Oldest to Newest
  • Newest to Oldest
  • Most Votes
Reply
  • Reply as topic
Log in to reply
This topic has been deleted. Only users with topic management privileges can see it.
  • michabbb@social.vivaldi.netM This user is from outside of this forum
    michabbb@social.vivaldi.netM This user is from outside of this forum
    michabbb@social.vivaldi.net
    wrote last edited by
    #1

    #OpenAI releases #PrivacyFilter β€” an open-weight #AI model for detecting & redacting #PII in text. Runs fully locally, no data ever leaves your machine. Apache 2.0 licensed. #opensource

    πŸ§΅πŸ‘‡#privacy

    πŸ” Detects 8 PII categories in a single forward pass: names, email addresses, phone numbers, physical addresses, URLs, dates, account numbers & secrets (passwords, API keys) β€” covering virtually all common sensitive data types

    Link Preview Image
    michabbb@social.vivaldi.netM newsgroup@social.vir.groupN 2 Replies Last reply
    0
    • michabbb@social.vivaldi.netM michabbb@social.vivaldi.net

      #OpenAI releases #PrivacyFilter β€” an open-weight #AI model for detecting & redacting #PII in text. Runs fully locally, no data ever leaves your machine. Apache 2.0 licensed. #opensource

      πŸ§΅πŸ‘‡#privacy

      πŸ” Detects 8 PII categories in a single forward pass: names, email addresses, phone numbers, physical addresses, URLs, dates, account numbers & secrets (passwords, API keys) β€” covering virtually all common sensitive data types

      Link Preview Image
      michabbb@social.vivaldi.netM This user is from outside of this forum
      michabbb@social.vivaldi.netM This user is from outside of this forum
      michabbb@social.vivaldi.net
      wrote last edited by
      #2

      🧠 Bidirectional token-classification β€” unlike autoregressive LLMs, #PrivacyFilter reads input from both directions simultaneously for deeper context awareness, catching subtle #PII that simple pattern-matching or RegEx rules miss

      ⚑ 1.5B parameter model with only ~50M active parameters (#MoE) β€” lightweight enough to run on a standard laptop or in a browser, yet achieves ~96–97% F1 score on standard #PII benchmarks #MachineLearning #AI

      michabbb@social.vivaldi.netM 1 Reply Last reply
      0
      • michabbb@social.vivaldi.netM michabbb@social.vivaldi.net

        🧠 Bidirectional token-classification β€” unlike autoregressive LLMs, #PrivacyFilter reads input from both directions simultaneously for deeper context awareness, catching subtle #PII that simple pattern-matching or RegEx rules miss

        ⚑ 1.5B parameter model with only ~50M active parameters (#MoE) β€” lightweight enough to run on a standard laptop or in a browser, yet achieves ~96–97% F1 score on standard #PII benchmarks #MachineLearning #AI

        michabbb@social.vivaldi.netM This user is from outside of this forum
        michabbb@social.vivaldi.netM This user is from outside of this forum
        michabbb@social.vivaldi.net
        wrote last edited by
        #3

        πŸ“ 128,000-token context window β€” processes entire legal documents, long email threads or large codebases in a single pass. No need to chunk text before filtering. #privacy #DataEngineering

        πŸ› οΈ Built for high-throughput workflows: CLI tool (opf), GPU & CPU support, interactive mode, structured JSON output with ANSI color-coded previews. Runs on-premises β€” data never sent to external servers #DevOps

        michabbb@social.vivaldi.netM 1 Reply Last reply
        0
        • michabbb@social.vivaldi.netM michabbb@social.vivaldi.net

          πŸ“ 128,000-token context window β€” processes entire legal documents, long email threads or large codebases in a single pass. No need to chunk text before filtering. #privacy #DataEngineering

          πŸ› οΈ Built for high-throughput workflows: CLI tool (opf), GPU & CPU support, interactive mode, structured JSON output with ANSI color-coded previews. Runs on-premises β€” data never sent to external servers #DevOps

          michabbb@social.vivaldi.netM This user is from outside of this forum
          michabbb@social.vivaldi.netM This user is from outside of this forum
          michabbb@social.vivaldi.net
          wrote last edited by
          #4

          πŸ”§ Fine-tunable on domain-specific data β€” adapts to medical, legal or enterprise environments where generic rules fail. Based on the open #gptoss model family. Available on #HuggingFace under Apache 2.0

          🚨 Caveat: #PrivacyFilter is a redaction & data minimization aid β€” NOT a compliance guarantee. It should be one layer in a holistic #privacybydesign approach. Always combine with human review for high-stakes use cases
          https://openai.com/index/introducing-openai-privacy-filter/

          1 Reply Last reply
          0
          • michabbb@social.vivaldi.netM michabbb@social.vivaldi.net

            #OpenAI releases #PrivacyFilter β€” an open-weight #AI model for detecting & redacting #PII in text. Runs fully locally, no data ever leaves your machine. Apache 2.0 licensed. #opensource

            πŸ§΅πŸ‘‡#privacy

            πŸ” Detects 8 PII categories in a single forward pass: names, email addresses, phone numbers, physical addresses, URLs, dates, account numbers & secrets (passwords, API keys) β€” covering virtually all common sensitive data types

            Link Preview Image
            newsgroup@social.vir.groupN This user is from outside of this forum
            newsgroup@social.vir.groupN This user is from outside of this forum
            newsgroup@social.vir.group
            wrote last edited by
            #5

            @michabbb Finally, a use for local LLMs.

            1 Reply Last reply
            1
            0
            • System shared this topic
            Reply
            • Reply as topic
            Log in to reply
            • Oldest to Newest
            • Newest to Oldest
            • Most Votes


            • Login

            • Login or register to search.
            • First post
              Last post
            0
            • Categories
            • Recent
            • Tags
            • Popular
            • World
            • Users
            • Groups