Skip to content

Commit f4fac1e

Browse files
author
ASE Bot
committed
chore: strengthen legal ops industry collection
1 parent b6aca22 commit f4fac1e

3 files changed

Lines changed: 20 additions & 2 deletions

File tree

industries/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@
77
| 🎙️ | [**Media & Publishing Systems**](media-publishing-systems.md) | 11 | Transcription, subtitles, podcast workflows, chaptering, localization, loudness cleanup, and final-mile publishing prep. |
88
| 💼 | [**Finance & Filings**](finance-filings.md) | 10 | Filings research, invoice intake, billing operations, reconciliation, and finance-adjacent reporting. |
99
| 🛒 | [**Ecommerce & Retail Operations**](ecommerce-retail-operations.md) | 11 | Catalog management, storefront automation, orders, inventory sync, marketplace support, and review-driven merchandising. |
10-
| ⚖️ | [**Legal Ops & Compliance**](legal-ops-compliance.md) | 12 | Contract workflows, forms, document review, archive search, and evidence-oriented legal and compliance support. |
10+
| ⚖️ | [**Legal Ops & Compliance**](legal-ops-compliance.md) | 21 | Contract workflows, forms, document review, archive search, and evidence-oriented legal and compliance support. |
1111
| 🩺 | [**Healthcare Documentation & Intake**](healthcare-documentation-intake.md) | 11 | Documentation intake, OCR, transcription, structured extraction, and biomedical literature support for paperwork-heavy workflows. |
1212
| 📈 | [**Product Analytics & Growth Ops**](product-analytics-growth-ops.md) | 10 | Product analytics, feature flags, rollout checks, session replay, privacy-friendly web analytics, and experiment/evaluation workflows. |
1313
| 📚 | [**DevRel & API Documentation**](devrel-api-documentation.md) | 14 | API docs, OpenAPI references, SDK generation, docs-site publishing, prose linting, and developer enablement workflows. |

industries/legal-ops-compliance.md

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,15 +12,24 @@ Contract workflows, forms, document review, archive search, and evidence-oriente
1212
| [Documenso Open Source Document Signing Platform](../skills/documenso-open-source-document-signing/) | Calendar, Email & Productivity | 12.6k ||
1313
| [DocuSeal Open Source Document Signing and PDF Form Platform](../skills/docuseal-document-signing-pdf-forms/) | Templates & Workflows | 11.7k ||
1414
| [OCRmyPDF Searchable PDF OCR Pipeline](../skills/ocrmypdf-searchable-pdf-ocr-pipeline/) | Media & Transcription | 33.2k ||
15+
| [Apache Tika Document Extractor](../skills/apache-tika-document-extractor/) | Data Extraction & Transformation | 3.7k ||
16+
| [Apache Tika Document Parser](../skills/apache-tika-document-parser/) | Data Extraction & Transformation | 3.7k ||
1517
| [pdfplumber Python PDF Text and Table Extraction Library](../skills/pdfplumber-python-pdf-text-table-extraction/) | Data Extraction & Transformation | 10.1k ||
18+
| [Parse local PDFs into agent-ready text, JSON, and screenshots with LiteParse](../skills/parse-local-pdfs-into-agent-ready-text-json-and-screenshots-with-liteparse/) | Data Extraction & Transformation | 5.1k | 37k/wk |
1619
| [Search PDFs, Office files, ebooks, and archives with one query before manual review](../skills/search-pdfs-office-files-ebooks-and-archives-with-one-query-before-manual-review/) | Research & Scraping | 9.6k ||
1720
| [Paperless-ngx Document OCR and Archive Management System](../skills/paperless-ngx-document-ocr-archive-management-system/) | Data Extraction & Transformation | 38.1k ||
1821
| [LangExtract LLM-Powered Structured Text Extraction](../skills/langextract-llm-structured-text-extraction/) | Data Extraction & Transformation | 35k ||
22+
| [Redact PII from text before sharing or indexing with scrubadub](../skills/redact-pii-from-text-before-sharing-or-indexing-with-scrubadub/) | Security & Verification | 421 ||
1923
| [Search large PDFs and read only the relevant pages before answering](../skills/search-large-pdfs-and-read-only-the-relevant-pages-before-answering/) | Data Extraction & Transformation | 17 ||
2024
| [Process, redact, OCR, and sign documents with Nutrient Agent Skill](../skills/process-redact-ocr-and-sign-documents-with-nutrient-agent-skill/) | Data Extraction & Transformation | 5 ||
2125
| [Convert dense PDFs into LLM-ready text and page-aligned markdown with olmOCR](../skills/convert-dense-pdfs-into-llm-ready-text-and-page-aligned-markdown-with-olmocr/) | Data Extraction & Transformation | 17.1k ||
26+
| [Turn documents into validated knowledge graphs with Docling Graph](../skills/turn-documents-into-validated-knowledge-graphs-with-docling-graph/) | Data Extraction & Transformation | 134 ||
27+
| [Extract structured markdown, JSON, and tagged-PDF-ready outputs from PDFs with OpenDataLoader PDF](../skills/extract-structured-markdown-json-and-tagged-pdf-ready-outputs-from-pdfs-with-opendataloader-pdf/) | Data Extraction & Transformation | 19.1k ||
2228
| [Enrich Paperless-ngx documents with AI-generated titles tags and correspondents using paperless-gpt](../skills/enrich-paperless-ngx-documents-with-ai-generated-titles-tags-and-correspondents-using-paperless-gpt/) | Data Extraction & Transformation | 2.3k ||
2329
| [Capture a live webpage as a clean PDF or readable archive for offline review with Percollate](../skills/capture-a-live-webpage-as-a-clean-pdf-or-readable-archive-for-offline-review-with-percollate/) | Research & Scraping | 4.6k | 588/wk |
30+
| [Extract structured data and attachments from raw email with MailParser](../skills/extract-structured-data-and-attachments-from-raw-email-mailparser/) | Calendar, Email & Productivity | 1.7k ||
31+
| [Strip quoted email history and signatures before summarizing inbound replies](../skills/strip-quoted-email-history-and-signatures-before-summarizing-inbound-replies/) | Calendar, Email & Productivity | 78 ||
32+
| [Load .mbox mail archives into SQLite for offline search, audits, and dataset joins](../skills/load-mbox-mail-archives-into-sqlite-for-offline-search-audits-and-dataset-joins/) | Calendar, Email & Productivity | 39 ||
2433

2534
## Editorial Caution
2635

scripts/industry-collections.json

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -69,15 +69,24 @@
6969
"documenso-open-source-document-signing",
7070
"docuseal-document-signing-pdf-forms",
7171
"ocrmypdf-searchable-pdf-ocr-pipeline",
72+
"apache-tika-document-extractor",
73+
"apache-tika-document-parser",
7274
"pdfplumber-python-pdf-text-table-extraction",
75+
"parse-local-pdfs-into-agent-ready-text-json-and-screenshots-with-liteparse",
7376
"search-pdfs-office-files-ebooks-and-archives-with-one-query-before-manual-review",
7477
"paperless-ngx-document-ocr-archive-management-system",
7578
"langextract-llm-structured-text-extraction",
79+
"redact-pii-from-text-before-sharing-or-indexing-with-scrubadub",
7680
"search-large-pdfs-and-read-only-the-relevant-pages-before-answering",
7781
"process-redact-ocr-and-sign-documents-with-nutrient-agent-skill",
7882
"convert-dense-pdfs-into-llm-ready-text-and-page-aligned-markdown-with-olmocr",
83+
"turn-documents-into-validated-knowledge-graphs-with-docling-graph",
84+
"extract-structured-markdown-json-and-tagged-pdf-ready-outputs-from-pdfs-with-opendataloader-pdf",
7985
"enrich-paperless-ngx-documents-with-ai-generated-titles-tags-and-correspondents-using-paperless-gpt",
80-
"capture-a-live-webpage-as-a-clean-pdf-or-readable-archive-for-offline-review-with-percollate"
86+
"capture-a-live-webpage-as-a-clean-pdf-or-readable-archive-for-offline-review-with-percollate",
87+
"extract-structured-data-and-attachments-from-raw-email-mailparser",
88+
"strip-quoted-email-history-and-signatures-before-summarizing-inbound-replies",
89+
"load-mbox-mail-archives-into-sqlite-for-offline-search-audits-and-dataset-joins"
8190
]
8291
},
8392
{

0 commit comments

Comments
 (0)