RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
Updated
Jul 5, 2024 - Python
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Artificial vision system for detection of people in natural environments from images taken from a drone
set of Data Science and Machine Learning tools
PathoPatcher is a Python project designed for accelerating Whole Slide Image Preprocessing, employing AI-based preprocessing techniques with features like annotation handling, color normalization, and configurable parameters
preprocS2 is an R package dedicated to basic preprocessing of Sentinel-2 Level-2A reflectance images.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
The CIFAKE project is a comprehensive effort to develop and implement techniques for distinguishing between AI-generated images and real images. This project leverages a combination of preprocessing techniques, feature extraction, machine learning, and deep learning models to accurately classify images.
Automated Time Series Forecasting
DupliPy is a quick and easy-to-use package that can handle text formatting and data augmentation tasks for NLP in Python. It now offers support for image augmentation tasks as well.
Document preprocessing scripts for the Nature of EU Rules project
An ultra-fast all-in-one FASTQ preprocessor (QC/adapters/trimming/filtering/splitting/merging...)
Culled from the UCI Machine Learning Repository, the Dry Bean Dataset (licensed under CC BY 4.0) provides valuable insights into bean classification and is a valuable resource for machine learning enthusiasts.
Analysis ready CMIP6 data in python the easy way with pangeo tools.
The Sentiment Analysis Model is a TensorFlow-based project that predicts text sentiment. Trained on Twitter data, it can be customized with other datasets for personalized sentiment predictions. The repository includes a single Jupyter Notebook with complete code for preprocessing, model training, and prediction.
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
Dataflow Programming for Machine Learning in R
Strand-Seq Quality Control Pipeline based on ashleys-qc
Sentiment Analysis on Honda customer complaints data using Python and SAS Enterprise Miner as part of a school project
Add a description, image, and links to the preprocessing topic page so that developers can more easily learn about it.
To associate your repository with the preprocessing topic, visit your repo's landing page and select "manage topics."