ASE2023

Number of papers: 18

A Closer Look at Different Difficulty Levels Code Generation Abilities of ChatGPT

Authors: Yan, Dapeng and Gao, Zhipeng and Liu, Zhiming
Abstract: Code generation aims to generate source code implementing human requirements illustrated with natural language specifications. With the rapid development of intelligent software engineering, automated code generation has become a hot research topic in both artificial intelligence and software engineering, and researchers have made significant achievements on code generation. More recently, large language models (LLMs) have demonstrated outstanding performance on code generation tasks, such as Ch...
Link: Read Paper
Labels: code generation, program synthesis, empirical study

An Empirical Study on Fine-Tuning Large Language Models of Code for Automated Program Repair

Authors: Huang, Kai and Meng, Xiangxin and Zhang, Jian and Liu, Yang and Wang, Wenjie and Li, Shuhao and Zhang, Yuqing
Abstract: The advent of large language models (LLMs) has opened up new opportunities for automated program repair (APR). In particular, some recent studies have explored how to leverage large language models of code (LLMCs) for program repair tasks and show promising results. However, most of them adopt the zero/few-shot learning paradigm for APR, which directly use LLMCs to generate the possibly correct code given its surrounding context. Though effective, the repair capabilities of LLMCs based on the fi...
Link: Read Paper
Labels: code generation, program repair, empirical study

Better Patching Using LLM Prompting, via Self-Consistency

Authors: Ahmed, Toufique and Devanbu, Premkumar
Abstract: Large Language models (LLMs) can be induced to solve non-trivial problems with “few-shot” prompts including illustrative problem-solution examples. Now if the few-shots also include “chain of thought” ($\mathcal{C}oT$) explanations, which are of the form problem-explanation-solution, LLMs will generate a “explained” solution, and perform even better. Recently an exciting, substantially better technique, self-consistency [1] ($\mathcal{S}-C$) has emerged, based on the intuition that there are man...
Link: Read Paper
Labels: code generation, program repair, prompt strategy

CAT-LM training language models on aligned code and tests

Authors: Rao, Nikitha and Jain, Kush and Alon, Uri and Le Goues, Claire and Hellendoorn, Vincent J
Abstract: Testing is an integral but often neglected part of the software development process. Classical test generation tools such as EvoSuite generate behavioral test suites by optimizing for coverage, but tend to produce tests that are hard to understand. Language models trained on code can generate code that is highly similar to that written by humans, but current models are trained to generate each file separately, as is standard practice in natural language processing, and thus fail to consider the ...
Link: Read Paper
Labels: code model, code model training, source code model

COMEX: A Tool for Generating Customized Source Code Representations

Authors: Das, Debeshee and Mathews, Noble Saji and Mathai, Alex and Tamilselvam, Srikanth and Sedamaki, Kranthi and Chimalakonda, Sridhar and Kumar, Atul
Abstract: Learning effective representations of source code is critical for any Machine Learning for Software Engineering (ML4SE) system. Inspired by natural language processing, large language models (LLMs) like Codex and CodeGen treat code as generic sequences of text and are trained on huge corpora of code data, achieving state of the art performance on several software engineering (SE) tasks. However, valid source code, unlike natural language, follows a strict structure and pattern governed by the un...
Link: Read Paper
Labels: code generation, source code model

Cell2Doc: ML Pipeline for Generating Documentation in Computational Notebooks

Authors: Mondal, Tamal and Barnett, Scott and Lal, Akash and Vedurada, Jyothi
Abstract: Computational notebooks have become the go-to way for solving data-science problems. While they are designed to combine code and documentation, prior work shows that documentation is largely ignored by the developers because of the manual effort. Automated documentation generation can help, but existing techniques fail to capture algorithmic details and developers often end up editing the generated text to provide more explanation and sub-steps. This paper proposes a novel machine-learning pipel...
Link: Read Paper
Labels: software maintenance and deployment, documentation generation

Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases

Authors: Tang, Ze and Ge, Jidong and Liu, Shangqing and Zhu, Tingwei and Xu, Tongtong and Huang, Liguo and Luo, Bin
Abstract: Large Language Models (LLMs) have demonstrated remarkable performance in code completion. However, due to the lack of domain-specific knowledge, they may not be optimal in completing code that requires intensive domain knowledge for example completing the library names. Although there are several works that have confirmed the effectiveness of fine-tuning techniques to adapt language models for code completion in specific domains. They are limited by the need for constant fine-tuning of the model...
Link: Read Paper
Labels: code generation, code completion, code model, code model training, source code model

From Misuse to Mastery: Enhancing Code Generation with Knowledge-Driven AI Chaining

Authors: Ren, Xiaoxue and Ye, Xinyuan and Zhao, Dehai and Xing, Zhenchang and Yang, Xiaohu
Abstract: Large Language Models (LLMs) have shown promising results in automatic code generation by improving coding efficiency to a certain extent. However, generating high-quality and reliable code remains a formidable task because of LLMs' lack of good programming practice, especially in exception handling. In this paper, we first conduct an empirical study and summarize three crucial challenges of LLMs in exception handling, i.e., incomplete exception handling, incorrect exception handling and abuse o...
Link: Read Paper
Labels: code generation, program synthesis

Generative Type Inference for Python

Authors: Peng, Yun and Wang, Chaozheng and Wang, Wenxuan and Gao, Cuiyun and Lyu, Michael R.
Abstract: Python is a popular dynamic programming language, evidenced by its ranking as the second most commonly used language on GitHub. However, its dynamic type system can lead to potential type errors, leading researchers to explore automatic type inference approaches for Python programs. Existing type inference approaches can be generally grouped into three categories, i.e., rule-based, supervised, and cloze- style approaches. The rule-based type inference approaches can ensure the accuracy of predic...
Link: Read Paper
Labels: static analysis, type inference

Improving Code Extraction from Coding Screencasts Using a Code-Aware Encoder-Decoder Model

Authors: Malkadi, Abdulkarim and Tayeb, Ahmad and Haiduc, Sonia
Abstract: Accurate automatic code extraction from tutorial videos is crucial for software developers seeking to reuse the code contained in these videos. Current methods using optical character recognition (OCR) often yield inaccurate results due to code complexity and variations in screencast formats. To address this issue, we introduce CodeT5-OCRfix, an approach that leverages the pre-trained code-aware large language model CodeT5 to enhance code extraction accuracy by post-processing OCRed code. We fir...
Link: Read Paper
Labels: code generation, program synthesis, code model, code model training, source code model

Nuances are the Key: Unlocking ChatGPT to Find Failure-Inducing Tests with Differential Prompting

Authors: Li, Tsz-On and Zong, Wenxi and Wang, Yibo and Tian, Haoye and Wang, Ying and Cheung, Shing-Chi and Kramer, Jeff
Abstract: Automated detection of software failures is an important but challenging software engineering task. It involves finding in a vast search space the failure-inducing test cases that contain an input triggering the software fault and an oracle asserting the incorrect execution. We are motivated to study how far this outstanding challenge can be solved by recent advances in large language models (LLMs) such as ChatGPT. However, our study reveals that ChatGPT has a relatively low success rate (28.8%)...
Link: Read Paper
Labels: program testing, differential testing

On the Evaluation of Neural Code Translation: Taxonomy and Benchmark

Authors: Jiao, Mingsheng and Yu, Tingrui and Li, Xuan and Qiu, Guanjie and Gu, Xiaodong and Shen, Beijun
Abstract: In recent years, neural code translation has gained increasing attention. While most of the research focuses on improving model architectures and training processes, we notice that the evaluation process and benchmark for code translation models are severely limited: they primarily treat source code as natural languages and provide a holistic accuracy score while disregarding the full spectrum of model capabilities across different translation types and complexity. In this paper, we present a co...
Link: Read Paper
Labels: code generation, code completion, empirical study

SCPatcher: Mining Crowd Security Discussions to Enrich Secure Coding Practices

Authors: Jiang, Ziyou and Shi, Lin and Yang, Guowei and Wang, Qing
Abstract: Secure coding practices (SCPs) have been proposed to guide software developers to write code securely to prevent potential security vulnerabilities. Yet, they are typically one-sentence principles without detailed specifications, e.g., “Properly free allocated memory upon the completion of functions and at all exit points.”, which makes them difficult to follow in practice, especially for software developers who are not yet experienced in secure programming. To address this problem, this paper p...
Link: Read Paper
Labels: software maintenance and deployment, code review

SMT Solver Validation Empowered by Large Pre-Trained Language Models

Authors: Sun, Maolin and Yang, Yibiao and Wang, Yang and Wen, Ming and Jia, Haoxiang and Zhou, Yuming
Abstract: SMT solvers are utilized to check the satisfiability of logic formulas and have been applied in various crucial domains, including software verification, test case generation, and program synthesis. However, bugs hidden in SMT solvers can lead to severe consequences, causing erroneous results in these domains. Therefore, ensuring the reliability and robustness of SMT solvers is of critical importance. Despite several testing approaches proposed for SMT solvers, generating effective test formulas...
Link: Read Paper
Labels: program testing, fuzzing

The Plastic Surgery Hypothesis in the Era of Large Language Models

Authors: Xia, Chunqiu Steven and Ding, Yifeng and Zhang, Lingming
Abstract: Automated Program Repair (APR) aspires to automatically generate patches for an input buggy program. Traditional APR tools typically focus on specific bug types and fixes through the use of templates, heuristics, and formal specifications. However, these techniques are limited in terms of the bug types and patch variety they can produce. As such, researchers have designed various learning-based APR tools with recent work focused on directly using Large Language Models (LLMs) for APR. While LLM-b...
Link: Read Paper
Labels: code generation, program repair

Twin Graph-Based Anomaly Detection via Attentive Multi-Modal Learning for Microservice System

Authors: Huang, Jun and Yang, Yang and Yu, Hang and Li, Jianguo and Zheng, Xiao
Abstract: Microservice architecture has sprung up over recent years for managing enterprise applications, due to its ability to independently deploy and scale services. Despite its benefits, ensuring the reliability and safety of a microservice system remains highly challenging. Existing anomaly detection algorithms based on a single data modality (i.e., metrics, logs, or traces) fail to fully account for the complex correlations and interactions between different modalities, leading to false negatives an...
Link: Read Paper
Labels: static analysis, bug detection

VALAR: Streamlining Alarm Ranking in Static Analysis with Value-Flow Assisted Active Learning

Authors: Liu, Pengcheng and Lu, Yifei and Yang, Wenhua and Pan, Minxue
Abstract: Static analyzers play a critical role in program defects and security vulnerabilities detection. Despite their importance, the widespread adoption of static analysis techniques in industrial development faces numerous obstacles, among which the high rate of false alarms constitutes a significant one. To address this issue, we propose a novel approach called Valar, which performs alarm ranking for advanced value-flow analysis using the active learning technique. Active learning algorithms minimiz...
Link: Read Paper
Labels: static analysis, bug detection

What Makes Good In-Context Demonstrations for Code Intelligence Tasks with LLMs?

Authors: Gao, Shuzheng and Wen, Xin-Cheng and Gao, Cuiyun and Wang, Wenxuan and Zhang, Hongyu and Lyu, Michael R.
Abstract: Pre-trained models of source code have gained widespread popularity in many code intelligence tasks. Recently, with the scaling of the model and corpus size, large language models have shown the ability of in-context learning (ICL). ICL employs task instructions and a few examples as demonstrations, and then inputs the demonstrations to the language models for making predictions. This new learning paradigm is training-free and has shown impressive performance in various natural language processi...
Link: Read Paper
Labels: general coding task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

ASE2023

A Closer Look at Different Difficulty Levels Code Generation Abilities of ChatGPT

An Empirical Study on Fine-Tuning Large Language Models of Code for Automated Program Repair

Better Patching Using LLM Prompting, via Self-Consistency

CAT-LM training language models on aligned code and tests

COMEX: A Tool for Generating Customized Source Code Representations

Cell2Doc: ML Pipeline for Generating Documentation in Computational Notebooks

Domain Adaptive Code Completion via Language Models and Decoupled Domain Databases

From Misuse to Mastery: Enhancing Code Generation with Knowledge-Driven AI Chaining

Generative Type Inference for Python

Improving Code Extraction from Coding Screencasts Using a Code-Aware Encoder-Decoder Model

Nuances are the Key: Unlocking ChatGPT to Find Failure-Inducing Tests with Differential Prompting

On the Evaluation of Neural Code Translation: Taxonomy and Benchmark

SCPatcher: Mining Crowd Security Discussions to Enrich Secure Coding Practices

SMT Solver Validation Empowered by Large Pre-Trained Language Models

The Plastic Surgery Hypothesis in the Era of Large Language Models

Twin Graph-Based Anomaly Detection via Attentive Multi-Modal Learning for Microservice System

VALAR: Streamlining Alarm Ranking in Static Analysis with Value-Flow Assisted Active Learning

What Makes Good In-Context Demonstrations for Code Intelligence Tasks with LLMs?

Files

README.md

Latest commit

History

README.md

File metadata and controls

ASE2023