Analyzing and Mitigating (with LLMs) the Security Misconfigurations of Helm Charts from Artifact Hub (MSR 2024 - Registered Reports) - MSR 2024

Mon 15 - Tue 16 April 2024 Lisbon, Portugal

co-located with ICSE 2024

Who

Francesco Minna, Fabio Massacci, Katja Tuma

Track

MSR 2024 Registered Reports

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

When

Tue 16 Apr 2024 14:48 - 14:53 at Grande Auditório - Security and Vision & Reflection Chair(s): Tim Menzies

Abstract

Background: Helm is a package manager that allows defining, installing, and upgrading applications with Kubernetes (K8s), a popular container orchestration platform. A Helm chart is a collection of files describing all dependencies, resources, and parameters required for deploying an application within a K8s cluster. Objective: We investigate the common (or unique) misconfigurations found by open-source chart analyzer tools in Helm chart repositories, and whether Large Language Models (LLMs) can recommend correct changes to such misconfigurations, even in the presence of maliciously injected misconfigurations. Method: We propose a pipeline to mine Helm charts from Artifact Hub, a popular centralized repository, and analyze them using state-of-the-art open-source tools, such as Checkov and KICS. First, such a pipeline will identify common and unique misconfigurations. Secondly, it will use LLMs to suggest mitigation for each misconfiguration, and the recommended refactoring will be analyzed again to see whether it satisfies the tools. Finally, it will inject into the charts potentially malicious mitigations (e.g., memory: john instead of memory: 250Mi, or over-privileged pods) and check whether they still satisfy the tool policies and whether LLMs can correctly refactor them. A final manual expert validation on a sub-sample will be used to provide Agresti-Coull-Wilson confidence intervals of the statistical results of the automated pipeline.

Francesco Minna

Vrije Universiteit Amsterdam

Netherlands

Fabio Massacci

University of Trento; Vrije Universiteit Amsterdam

Italy

Katja Tuma

Vrije Universiteit Amsterdam

Netherlands

Time Zone

The program is currently displayed in (GMT+01:00) Lisbon.

Use conference time zone: (GMT+01:00) LisbonSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Session Program

Tue 16 Apr
Displayed time zone: Lisbon change

	14:00 - 15:30	Security and Vision & ReflectionData and Tool Showcase Track / Technical Papers / Registered Reports / Vision and Reflection at Grande Auditório Chair(s): Tim Menzies North Carolina State University

	14:00 12m Talk		Quantifying Security Issues in Reusable JavaScript Actions in GitHub Workflows Technical Papers Hassan Onsori Delicheh University of Mons, Belgium, Alexandre Decan University of Mons; F.R.S.-FNRS, Tom Mens University of Mons Pre-print
	14:12 12m Talk		What Can Self-Admitted Technical Debt Tell Us About Security? A Mixed-Methods Study Technical Papers Nicolás E. Díaz Ferreyra Hamburg University of Technology, Mojtaba Shahin RMIT University, Mansooreh Zahedi The Univeristy of Melbourne, Sodiq Quadri Hamburg University of Technology, Riccardo Scandariato Hamburg University of Technology Pre-print
	14:24 12m Talk		Are Latent Vulnerabilities Hidden Gems for Software Vulnerability Prediction? An Empirical Study Technical Papers Triet Le The University of Adelaide, Xiaoning Du Monash University, Australia, Muhammad Ali Babar School of Computer Science, The University of Adelaide
	14:36 4m Talk		MalwareBench: Malware samples are not enough Data and Tool Showcase Track Nusrat Zahan North Carolina State University, Philipp Burckhardt Socket, Inc, Mikola Lysenko Socket, Inc, Feross Aboukhadijeh Socket, Inc, Laurie Williams North Carolina State University
	14:40 4m Talk		Hash4Patch: A Lightweight Low False Positive Tool for Finding Vulnerability Patch Commits Data and Tool Showcase Track Simone Scalco University of Trento, Ranindya Paramitha University of Trento
	14:44 4m Talk		MegaVul: A C/C++ Vulnerability Dataset with Comprehensive Code Representations Data and Tool Showcase Track Chao Ni School of Software Technology, Zhejiang University, Liyu Shen Zhejiang University, Xiaohu Yang Zhejiang University, Yan Zhu Zhejiang University, Shaohua Wang Central University of Finance and Economics Pre-print
	14:48 5m Talk		Analyzing and Mitigating (with LLMs) the Security Misconfigurations of Helm Charts from Artifact Hub Registered Reports Francesco Minna Vrije Universiteit Amsterdam, Fabio Massacci University of Trento; Vrije Universiteit Amsterdam, Katja Tuma Vrije Universiteit Amsterdam
	14:53 5m Talk		Fixing Smart Contract Vulnerabilities: A Comparative Analysis of Literature and Developer's Practices Registered Reports Francesco Salzano University of Molise, Simone Scalabrino University of Molise, Rocco Oliveto University of Molise, Remo Pareschi University of Molise
	15:00 30m Talk		Then, Now, and Next: Constants in Changing MSR Research Landscape Vision and Reflection Ayushi Rastogi University of Groningen, The Netherlands