Open-Source Nepali Health-QA Language Model with FineTuned Transformers

license

datasets

language

metrics

pipeline_tag

Open-Source Nepali Health-QA Language Model with FineTuned Transformers

MT5-small is finetuned with large corups of Nepali Health Question-Answering Dataset.

Introduction

In the ever-evolving landscape of Natural Language Processing (NLP), our project, titled ”OPEN-SOURCE NEPALI HEALTH-QA LANGUAGE MODEL WITH FINE-TUNED TRANSFORMERS AND EXTERNAL KNOWLEDGE BASES,” represents a dedicated effort to address the pressing need for accessible and accurate health-related information in the Nepali language. This project is driven by the recognition of the critical role that natural language understanding plays in fostering meaningful interactions, particularly in the domain of health-related inquiries. Our question-answering model doesn’t just answer queries; it provides a valuable second opinion, offering additional perspectives and comprehensive insights on healthcare matters.

Training Procedure

The model was trained for more than 125 epochs with the following training parameters:

Learning Rate: 2e-4

Batch Size: 2

Gradient Accumulation Steps: 8

FP16 (mixed-precision training): Disabled

Optimizer: Adafactor

The training loss consistently decreased, indicating successful learning.

Use Case

  !pip install transformers sentencepiece

  from transformers import MT5ForConditionalGeneration, AutoTokenizer 
  # Load the trained model
  model = MT5ForConditionalGeneration.from_pretrained("Chhabi/mt5-small-finetuned-Nepali-Health-50k-2")
  
  # Load the tokenizer for generating new output
  tokenizer = AutoTokenizer.from_pretrained("Chhabi/mt5-small-finetuned-Nepali-Health-50k-2",use_fast=True)


    
  query = "म धेरै थकित महसुस गर्छु र मेरो नाक बगिरहेको छ। साथै, मलाई घाँटी दुखेको छ र अलि टाउको दुखेको छ। मलाई के भइरहेको छ?"
  input_text = f"answer: {query}"
  inputs = tokenizer(input_text,return_tensors='pt',max_length=256,truncation=True).to("cuda")
  print(inputs)
  generated_text = model.generate(**inputs,max_length=512,min_length=256,length_penalty=3.0,num_beams=10,top_p=0.95,top_k=100,do_sample=True,temperature=0.7,num_return_sequences=3,no_repeat_ngram_size=4)
  print(generated_text)
  # generated_text
  generated_response = tokenizer.batch_decode(generated_text,skip_special_tokens=True)[0]
  tokens = generated_response.split(" ")
  filtered_tokens = [token for token in tokens if not token.startswith("<extra_id_")]
  print(' '.join(filtered_tokens))

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md
mt-5-finetuned.ipynb		mt-5-finetuned.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open-Source Nepali Health-QA Language Model with FineTuned Transformers

Table of Contents

Introduction

Training Procedure

Use Case

Evaluation

BLEU score:

Inference from finetuned model:

FineTune

How to finetune your model for your custom datasets?

About

Releases

Packages

Languages

Chhabii/FinetuneMT5-Nepali-Health-Chat

Folders and files

Latest commit

History

Repository files navigation

Open-Source Nepali Health-QA Language Model with FineTuned Transformers

Table of Contents

Introduction

Training Procedure

Use Case

Evaluation

BLEU score:

Inference from finetuned model:

FineTune

How to finetune your model for your custom datasets?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages