Skip to content
View SHUzhangshuo's full-sized avatar

Highlights

  • Pro

Block or report SHUzhangshuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

【rulelift,规则策略评估及优化的python包。欢迎star和pr~ 】 Rulelift is a Python toolkit designed for strategy rule effectiveness analysis and automatic rule mining.

Python 79 19 Updated May 10, 2026

Web Structured Data Extraction Agent

HTML 15 6 Updated Mar 10, 2026

WebMainBench is a high-precision benchmark for evaluating web main content extraction.

Python 16 11 Updated Jun 13, 2026

MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.

Python 260 26 Updated Mar 27, 2026

A Python package for MinerU document processing and RAG (Retrieval-Augmented Generation) knowledge base construction.

Python 1 Updated Nov 25, 2025

Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.

Python 67,665 5,697 Updated Jun 15, 2026

基于 MinerU 的智能论文阅读助手,提供 PDF 文档解析、OCR 识别、表格提取等功能。

Python 19 3 Updated Dec 2, 2025

data-find-questions

HTML 1 Updated Dec 7, 2025

Dingo: A Comprehensive AI Data Quality Evaluation Tool

JavaScript 1 Updated Aug 11, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 1 Updated Dec 4, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 1 Updated Aug 19, 2025
Python 1 Updated Sep 12, 2025

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

Python 713 72 Updated Jun 16, 2026