
OpenDataLab
@OpenDataLab_AI
Followers
156
Following
21
Media
24
Statuses
42
Joined February 2023
Based on the Classification System, we divided #WanJuanSiLu into 7 major categories, covering a wide range of content with characteristics of the language's geographic location, such as #history, #politics, #culture, #shopping, #encyclopedic knowledge.
0
0
0
Beyond basic reasoning, REST specifically evaluates several under-tested capabilities: contextual priority allocation 🗂️, cross-problem interference resistance ⚖️, and dynamic cognitive load management⚙️. Paper link:
arxiv.org
Recent Large Reasoning Models (LRMs) have achieved remarkable progress on task-specific benchmarks, yet their evaluation methods remain constrained by isolated problem-solving paradigms. Existing...
0
0
0
#MathFusion is a novel framework that enhances mathematical reasoning through cross-problem instruction synthesis. 🦾Experimental results demonstrate that it achieves substantial improvements in mathematical reasoning while maintaining high data efficiency. #AI #Datasets
0
0
0
@CherryStudioHQ @cursor_ai This architecture enables any #AI tool supporting MCP protocol to easily integrate and leverage MinerU's document processing capabilities.📲.
0
0
0
🔍 MinerU MCP Server source code released! It accepts commands from #MCP protocol-supported clients (e.g., @CherryStudioHQ , @cursor_ai ), invokes MinerU API for actual conversion, and returns results to clients.🚀.Get MinerU MCP Server source code from:
1
0
0
#MinerU has officially cooperated with @CherryStudioHQ. You can directly call the MinerU function in Cherry Studio. MinerU officially provides each Cherry Studio user with a document processing quota of up to 500 pages per day.
1
0
6
#OmniDocBench has been accepted by #CVPR 2025! OmniDocBench is a benchmark for evaluating diverse document parsing in real-world scenarios. 🤓We conducted an evaluation of current mainstream PDF parsing tools using OmniDocBench, and the results are as follows.
1
1
0
#MinerU leverages the sophisticated PDF-Extract-Kit models to extract content from diverse documents effectively and ensure the accuracy of the final results. As its core, MinerU commits to facilitating the #mathematical and extended formulas parsing.
0
0
0
RT @AndrewYNg: Agentic Document Extraction just got much faster! From previous 135sec median processing time down to 8sec. Extracts not jus….
0
609
0
We are very pleased to know that one of our users just launched a website about #MinerU! The website has deployed open-source solutions for data processing, tutoring, sharing of usage experience, etc. Welcome to join the community :
0
0
0
Document content analysis has been a crucial research area in computer vision. We present #MinerU, an open-source solution for high-precision document content extraction. Deep dive into MinerU via the technical report:
0
0
1
MinerU Dify Plugin has been launched on Dify Marketplace. The plugin was jointly developed by MinerU and @dify_ai . From now on, you can use it to set up workflow on Dify so that you can parse complex document data for any downstream LLM use case with high efficiency.
1
0
1