Skip to content
@git-disl

git-disl

Pinned Loading

  1. PokeLLMon PokeLLMon Public

    Python 178 15

Repositories

Showing 10 of 70 repositories
  • Booster Public

    This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation" (ICLR2025 Oral).

    git-disl/Booster’s past year of commit activity
    Shell 19 Apache-2.0 0 0 0 Updated Mar 19, 2025
  • GTLLMZoo Public

    GTLLMZoo: A comprehensive framework that aggregates LLM benchmark data from multiple sources with an interactive UI for efficient model comparison, filtering, and evaluation across performance, safety, and efficiency metrics.

    git-disl/GTLLMZoo’s past year of commit activity
    Python 1 0 0 0 Updated Mar 14, 2025
  • Safety-Tax Public

    This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".

    git-disl/Safety-Tax’s past year of commit activity
    Python 11 Apache-2.0 0 0 0 Updated Mar 11, 2025
  • awesome_LLM-harmful-fine-tuning-papers Public

    A survey on harmful fine-tuning attack for large language model

    git-disl/awesome_LLM-harmful-fine-tuning-papers’s past year of commit activity
    149 3 0 0 Updated Mar 7, 2025
  • awesome-LLM-game-agent-papers Public

    A Survey on Large Language Model-Based Game Agents

    git-disl/awesome-LLM-game-agent-papers’s past year of commit activity
    529 20 0 0 Updated Mar 4, 2025
  • Virus Public

    This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"

    git-disl/Virus’s past year of commit activity
    Python 44 Apache-2.0 3 0 0 Updated Feb 2, 2025
  • llm-topla Public
    git-disl/llm-topla’s past year of commit activity
    Jupyter Notebook 5 0 1 0 Updated Jan 2, 2025
  • PFT Public
    git-disl/PFT’s past year of commit activity
    Python 1 0 0 0 Updated Dec 6, 2024
  • Chameleon Public
    git-disl/Chameleon’s past year of commit activity
    Python 6 1 1 0 Updated Nov 18, 2024
  • Vaccine Public

    This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)

    git-disl/Vaccine’s past year of commit activity
    Shell 40 Apache-2.0 4 0 0 Updated Nov 18, 2024

Top languages

Loading…

Most used topics

Loading…