概览

AI工程化开源组织

目标¶

致力于搭建AI模型从学术界到工程界之间的桥梁，解决AI模型应用的最后一公里。

🏗️Project	⭐️Stars	📝Description
LLM-DOC		大模型研究院资料馆
LLM-EXAM		大模型中文测试题库-民间版本

🏗️Project	⭐️Stars	📝Description
Knowledge-QA-LLM		QA based on local knowledge and LLM.

Program Language	🏗️Project	📝Description
Python	🔥RapidOCR	Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle. （将PaddleOCR模型做了转换，采用ONNXRuntime推理，速度很快）
C++	RapidOcrOnnx	rapidocr onnx cpp
C++	RapidOcrNcnn	RapidOCR ncnn 推理
Java	RapidOcr—Java	Java代码实现调用RapidOCR(基于PaddleOCR)，适配Mac、Win、Linux，支持最新PP-OCRv4
Java	RapidOcrOnnxJvm	RapidOcr onnx java kotlin jni test
Java	RapidOcrNcnnJvm	RapidOcr ncnn java kotlin jni
C#	RapidOCRCSharp	本项目为Windows平台C# WinForm范例
Android	RapidOcrAndroidOnnx	RapidOcr onnxruntime推理 for Android
Android	RapidOcrAndroidOnnxCompose	opencv onnxruntime ocr android demo, jetpack compose + kotlin
易语言	RapidOCRServer	RapidOCR HTTP 服务。win32程序，易语言编写

🏗️Project	⭐️Stars	📝Description
RapidTableDetection		检测和提取各种场景图片中的表格区域，并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.
RapidUnWrap		修正文档扭曲，使用onnx模型简单轻量部署，未来持续跟进最新最好的文档矫正方案和模型,Correct document distortion using a lightweight ONNX model for easy deployment. We will continue to follow and integrate the latest and best document correction solutions and models in the future.
🔥RapidDoc		针对文档类图像做内容提取，将文档类图像一比一输出到Word或者Txt中，便于进一步使用或处理。后续计划支持输入PDF/图像，输出对应json格式、Txt格式、Word格式和Markdown格式。
RapidOrientation		文档方向分类
RapidLaTexOCR		公式识别:Formula recognition based on LaTeX-OCR and ONNXRuntime.
RapidTable		源自PP-Structure的表格识别算法，模型转换为ONNX，推理引擎采用ONNXRuntime，部署简单，无内存泄露问题。
TableStructureRec		整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX
RapidLayout		Analysis of Chinese and English layouts 中英文版面分析
RapidOCRPDF		PDF提取: Based on RapidOCR, extract the PDF content.
ExtractOfficeContent		Office内容提取

🏗️Project	⭐️Stars	📝Description
RapidASR		商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.
RapidTTS		A cross platform implementation of Text-to-Speech based on ONNXRuntime.
RapidVoice		The engineering implementation of SenseVoice (from Alibaba)
RapidPunc		A library for adding punctuation into a text from ASR.
paraformer_simple		paraformer 语音识别模型的简化推理代码
RapidAudioKit		It's for the repository of audio resampling tools
RapidVAD		A library for splitting audio for ASR.
RapidTP-Aligns		语音的时间戳预测