Skip to content

Commit

Permalink
fix compilation bugs in dataset.cpp
Browse files Browse the repository at this point in the history
  • Loading branch information
huangzhengxiang authored Dec 15, 2024
1 parent 47027e1 commit 460bf40
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions transformers/llm/engine/src/dataset.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -209,8 +209,8 @@ std::vector<std::vector<std::vector<PromptItem>>> shareGPT(std::string prompt_fi
if (sample_size > 0 && sample_size < dialogs.size()){
std::random_device rd;
std::mt19937 g(rd());
std::sample(dialogs.begin(), dialogs.end(), std::back_inserter(dataset),
sample_size, g);
std::shuffle(dialogs.begin(), dialogs.end(), g);
dataset.insert(dataset.end(), dialogs.begin(), dialogs.begin() + sample_size);
dialogs = dataset;
// store dialogs to file
write_jsonl(genSampleName(prompt_file, sample_size), dialogs);
Expand Down

0 comments on commit 460bf40

Please sign in to comment.