fraud

Pronunciation: /frɔːd/ (FRAWD)

Simplified Synthetic Data

fraud is a python package designed to streamline synthetic data for finetuning machine learning models.

Data scarcity is a limiting factor. While real data is the ideal solution; however it is often expensive, time-consuming, and resource-intensive.

Synthetic data offers an effective middle ground, enabling models to significantly enhance their performance by supplementing smaller datasets.

Usage

Here's a basic example to get you started.

import fraud as fr

synthetic_samples = fr.from_str('Could you please meet {name} at {time}', 20)

Predicting Templates

Grab a sample from your dataset to make a template from it!

import fraud as fr

predicted_template = fr.predict_template(
    sample='My name is Trevor and I am a Data Scientist.',
    labels=['name','job'],
    threshold=0.5
)

fr.from_str(predicted_template, 5)

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
fraud		fraud
tests		tests
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
run_tests.sh		run_tests.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fraud

Pronunciation: /frɔːd/ (FRAWD)

Simplified Synthetic Data

Usage

Predicting Templates

About

Releases

Packages

Languages

TrevorW-code/fraud

Folders and files

Latest commit

History

Repository files navigation

fraud

Pronunciation: /frɔːd/ (FRAWD)

Simplified Synthetic Data

Usage

Predicting Templates

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages