NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.
-
Updated
Apr 1, 2024
NetEaseCrowd dataset, a collection of data obtained from You Ling crowdsourcing platform, Fuxi AI Lab, NetEase.
Implementation of data typology for imbalanced datasets.
Quilt: Robust Data Segment Selection against Concept Drifts (AAAI 2024)
Find illustrations in historic book using computer vision
Embrace limited and imperfect training datasets in plant disease recognition using deep learning.
Compute clustering on your data in a visual, intuitive way with FiftyOne and Sklearn!
You can’t handle the (dirty) truth: Data-centric insights improve pseudo-labeling
Filter a float-valued field on two ranges simultaneously with this FiftyOne Plugin!
Repository for the Data-Centric AI Competition
Cleanlab and MachineHack Organised Data-Centric AI Competition 2023. This is One of Solution I tried and achieved 13th rank.
Lab assignments for Introduction to Data-Centric AI, MIT IAP 2023 👩🏽💻
Semantically search through OCR text blocks with Qdrant, Sentence Transformers, and FiftyOne!
🧼🔎 SelfClean revised versions of benchmark datasets for more reliable performance estimation.
Applying various data engineering techniques into image classification task for KAIST DS801 term project
Course of data centric AI. ITMO University. AI Talent Hub.
📕 flyswot book on developing a pragmatic machine learning workflow in a library setting
A multi-view panorama of Data-Centric AI: Techniques, Tools, and Applications (ECAI Tutorial 2024)
Customer churn train/prediction library with automatic dataset size optimisation features.
Hugging Face Plugins for FiftyOne
Add a description, image, and links to the data-centric-ai topic page so that developers can more easily learn about it.
To associate your repository with the data-centric-ai topic, visit your repo's landing page and select "manage topics."