Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
-
Updated
Aug 26, 2021 - Python
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
Large-scale pretraining for dialogue
A light-weight, flexible, and expressive statistical data testing library
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
Use this template repository to write projects and tenders data ingestion pipelines
Extract Transform Load for Python 3.5+
Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/
Large-scale pretrained models for goal-directed dialog
Predict the Power Production of a solar panel farm from Weather Measurements using Machine Learning
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
Manipulating VASP files with Python.
Data and tools for generating and inspecting OLMo pre-training data.
All-in-one text de-duplication
Open pixelated STEM framework
An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.
Forte is a flexible and powerful ML workflow builder. This is part of the CASL project: http://casl-project.ai/
Python Stream Processing
VIP is a python package/library for angular, reference star and spectral differential imaging for exoplanet/disk detection through high-contrast imaging.
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
Add a description, image, and links to the data-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-processing topic, visit your repo's landing page and select "manage topics."