Skip to content
/ MPSC Public

[ACL 2024] Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency

License

Notifications You must be signed in to change notification settings

skpig/MPSC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency [ACL 2024]

Environment

  1. Install required package: pip install -r requirements.txt
  2. Download benchmark dataset from google drive to data dir
  3. Download auther generated outputs from google drive [Available Soon!] to runtime dir
  4. Update api.py to your own OpenAI config

Directory Structure

|-- data # four code generation datasets
|-- runtime # runtime files including LLM generated results and inter-consistency measurements
|-- src
    |-- pipeline.py # the entry point for LLM sampling & inter-consistency measurements. All results will be saved in `runtime`.
    |-- graph.py # the entry point of MPSC
    |-- evaluation.py, _evaluation.py # evaluation metrics
    |-- execution.py, _execution.py # execution process for inter-consistency measurements
    |-- api.py # OpenAI api 
    |-- exemplars # ICL exemplars for test case generation

Reproduction

  • Directly apply author provided LLM generated results for MPSC
    python3 graph.py
    
  • MPSC from scratch (Warning: may cause a large number of OpenAI API calls)
    python3 pipeline.py
    python3 graph.py
    

Usage of MPSC

We also provide a code snippet of MPSC for other tasks in MPSC dir.

About

[ACL 2024] Enhancing Large Language Models in Coding Through Multi-Perspective Self-Consistency

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy