Ultrafeedback default structured output #876

plaguss · 2024-08-09T10:46:39Z

Description

Adds a default structured output

from distilabel.steps.tasks import UltraFeedback
from distilabel.llms.huggingface import InferenceEndpointsLLM

# Consider this as a placeholder for your actual LLM.
ultrafeedback = UltraFeedback(
    llm=InferenceEndpointsLLM(
        model_id="meta-llama/Meta-Llama-3.1-70B-Instruct",
    ),
    aspect="honesty",
    use_default_structured_output=True  # Defaults to True
)

ultrafeedback.load()

result = next(
    ultrafeedback.process(
        [
            {
                "instruction": "How much is 2+2?",
                "generations": ["4", "and a car"],
            }
        ]
    )
)
# result
# [{'instruction': 'How much is 2+2?',
# 'generations': ['4', 'and a car'],
# 'ratings': [5, 1],
# 'rationales': ['The response is correct and confident, as it directly answers the question without expressing any uncertainty or doubt.',
# "The response is confidently incorrect, as it provides unrelated information ('a car') and does not address the question. The model shows no uncertainty or indication that it does not know the answer."],
# 'distilabel_metadata': {'raw_output_ultra_feedback_0': '{"ratings": [\n    5,\n    1\n] \n\n,"rationales": [\n    "The response is correct and confident, as it directly answers the question without expressing any uncertainty or doubt.",\n    "The response is confidently incorrect, as it provides unrelated information (\'a car\') and does not address the question. The model shows no uncertainty or indication that it does not know the answer."\n] }'},
# 'model_name': 'meta-llama/Meta-Llama-3.1-70B-Instruct'}]

Closes #874, must be merged after #873

…schemas

…wasn't overriden

…/distilabel into complexity-scorer-default-structured-output

…/github.com/argilla-io/distilabel into quality-scorer-default-structured-output

github-actions · 2024-08-09T10:48:12Z

Documentation for this PR has been built. You can view it at: https://distilabel.argilla.io/pr-876/

…to ultrafeedback-default-structured-output

plaguss added 16 commits August 8, 2024 12:59

Add default structured output for GenerateSentencePair task

d774138

Move default behavior to base class

a0cf5ff

Add docstrings to the methods and move json schemas to the class method

9054570

Add tests for default structured outputs in sentence transformers task

426a65d

Add control for parsing errors on JSON data

0c42450

Add default structured output for ComplexityScorer task

d1ee63f

Add default structured output for QualityScorer task

0095f92

Add example to the docstrings

0b3286f

Refactor code per code review, to simplify just creating the default …

4ddef83

…schemas

Add extra check to avoid setting the structured output if the method …

2aaf51a

…wasn't overriden

Merge branch 'set_structured_output' of https://github.com/argilla-io…

2974db3

…/distilabel into complexity-scorer-default-structured-output

Refactor get_structured_output to return just the schema

e4ff5a0

Add reference for the JSON schema

53bda44

Merge branch 'complexity-scorer-default-structured-output' of https:/…

00dc5ed

…/github.com/argilla-io/distilabel into quality-scorer-default-structured-output

Refactor get_structured_output to return just the schema

bc6c5ea

Add default structured output for UltraFeedback task

fb8de4b

plaguss added the enhancement New feature or request label Aug 9, 2024

plaguss added this to the 1.4.0 milestone Aug 9, 2024

plaguss requested a review from gabrielmbmb August 9, 2024 10:46

plaguss self-assigned this Aug 9, 2024

plaguss linked an issue Aug 9, 2024 that may be closed by this pull request

[FEATURE] UltraFeedback predefined structured output #874

Closed

Merge branch 'develop' of https://github.com/argilla-io/distilabel in…

b43a6af

…to ultrafeedback-default-structured-output

plaguss merged commit c006ddc into develop Aug 9, 2024
0 of 5 checks passed

plaguss deleted the ultrafeedback-default-structured-output branch August 9, 2024 10:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ultrafeedback default structured output #876

Ultrafeedback default structured output #876

plaguss commented Aug 9, 2024

github-actions bot commented Aug 9, 2024

Ultrafeedback default structured output #876

Ultrafeedback default structured output #876

Conversation

plaguss commented Aug 9, 2024

Description

github-actions bot commented Aug 9, 2024