Skip to content
@git-disl

git-disl

Pinned Loading

  1. PokeLLMon PokeLLMon Public

    Python 190 15

Repositories

Showing 10 of 72 repositories
  • Antidote Public

    This is the unofficial re-implementation of "Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning Attack" (ICML2025)

    git-disl/Antidote’s past year of commit activity
    Shell 1 0 0 0 Updated Jul 14, 2025
  • Fusion-Shot Public
    git-disl/Fusion-Shot’s past year of commit activity
    Jupyter Notebook 1 0 0 0 Updated Jul 13, 2025
  • awesome_LLM-harmful-fine-tuning-papers Public

    A survey on harmful fine-tuning attack for large language model

    git-disl/awesome_LLM-harmful-fine-tuning-papers’s past year of commit activity
    195 6 0 0 Updated Jul 1, 2025
  • GTLLMZoo Public

    GTLLMZoo: A comprehensive framework that aggregates LLM benchmark data from multiple sources with an interactive UI for efficient model comparison, filtering, and evaluation across performance, safety, and efficiency metrics.

    git-disl/GTLLMZoo’s past year of commit activity
    Python 3 0 0 0 Updated Jun 12, 2025
  • awesome-LLM-game-agent-papers Public

    A Survey on Large Language Model-Based Game Agents

    git-disl/awesome-LLM-game-agent-papers’s past year of commit activity
    661 23 0 0 Updated Apr 30, 2025
  • Booster Public

    This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation" (ICLR2025 Oral).

    git-disl/Booster’s past year of commit activity
    Shell 29 Apache-2.0 1 1 0 Updated Mar 22, 2025
  • Safety-Tax Public

    This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".

    git-disl/Safety-Tax’s past year of commit activity
    Python 21 Apache-2.0 0 1 0 Updated Mar 11, 2025
  • Virus Public

    This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"

    git-disl/Virus’s past year of commit activity
    Python 50 Apache-2.0 3 0 0 Updated Feb 2, 2025
  • llm-topla Public
    git-disl/llm-topla’s past year of commit activity
    Jupyter Notebook 6 1 1 0 Updated Jan 2, 2025
  • PFT Public
    git-disl/PFT’s past year of commit activity
    Python 1 0 0 0 Updated Dec 6, 2024
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy