Mon 15 AprDisplayed time zone: Lisbon change
09:00 - 10:30 | Day 1: OpeningTechnical Papers / MSR Awards / Social Events / Tutorials / Data and Tool Showcase Track / Mining Challenge / Registered Reports / Industry Track / MIP Award / Vision and Reflection / Keynotes at Grande Auditório Chair(s): Diomidis Spinellis Athens University of Economics and Business & Delft University of Technology | ||
09:00 30mDay opening | Opening Session & Award Announcements MSR Awards | ||
09:30 30mAwards | MSR 2024 Foundational Contribution Award talk MSR Awards Margaret-Anne Storey University of Victoria | ||
10:00 30mTalk | Most Influential Paper Award talk MIP Award Eirini Kalliamvakou GitHub |
10:30 - 11:00 | Coffee for MSR newcomersSocial Events at Open Space (reserved area) Chair(s): Federica Sarro University College London, Alexander Serebrenik Eindhoven University of Technology | ||
10:30 30mCoffee break | Coffee for MSR newcomers Social Events |
11:00 - 12:30 | Ecosystems, Reuse and APIs & TutorialsData and Tool Showcase Track / Technical Papers / Tutorials at Almada Negreiros Chair(s): Mahmoud Alfadel University of Waterloo, Ayushi Rastogi University of Groningen, The Netherlands | ||
11:00 12mTalk | Thirty-Three Years of Mathematicians and Software Engineers: A Case Study of Domain Expertise and Participation in Proof Assistant Ecosystems Technical Papers Gwenyth Lincroft Northeastern University, Minsung Cho Northeastern University, Mahsa Bazzaz Northeastern University, Katherine Hough Northeastern University, Jonathan Bell Northeastern University Pre-print Media Attached | ||
11:12 12mTalk | Boosting API Misuse Detection via Integrating API Constraints from Multiple Sources Technical Papers Can Li Nanjing University of Aeronautics and Astronautics, Jingxuan Zhang Nanjing University of Aeronautics and Astronautics, Yixuan Tang Nanjing University of Aeronautics and Astronautics, Zhuhang Li Nanjing University of Aeronautics and Astronautics, Tianyue Sun Nanjing University of Aeronautics and Astronautics | ||
11:24 6mTalk | Availability and Usage of Platform-Specific APIs: A First Empirical Study Technical Papers Pre-print Media Attached File Attached | ||
11:30 4mTalk | AndroLibZoo: A Reliable Dataset of Libraries Based on Software Dependency Analysis Data and Tool Showcase Track Jordan Samhi CISPA Helmholtz Center for Information Security, Tegawendé F. Bissyandé University of Luxembourg, Jacques Klein University of Luxembourg | ||
11:34 4mTalk | Goblin: A Framework for Enriching and Querying the Maven Central Dependency Graph Data and Tool Showcase Track Damien Jaime Sorbonne Université - Lip6 - SAP, Joyce El Haddad Paris Dauphine-PSL Université, CNRS, LAMSADE, Pascal Poizat Université Paris Nanterre & LIP6 Pre-print File Attached | ||
11:38 4mTalk | Dataset: Copy-based Reuse in Open Source Software Data and Tool Showcase Track Mahmoud Jahanshahi Research Assistant, University of Tennessee Knoxville, Audris Mockus The University of Tennessee & Vilnius University Pre-print | ||
11:45 45mTalk | Mining Our Way Back to Incremental Builds for DevOps Pipelines Tutorials Shane McIntosh University of Waterloo Pre-print |
11:00 - 12:30 | Defects, Bugs and IssuesTechnical Papers / MSR Awards / Social Events / Tutorials / Data and Tool Showcase Track / Mining Challenge / Registered Reports / Industry Track / MIP Award / Vision and Reflection / Keynotes at Grande Auditório Chair(s): Wesley Assunção North Carolina State University | ||
11:00 12mTalk | Enhancing Performance Bug Prediction Using Performance Code Metrics Technical Papers Guoliang Zhao Computer Science of Queen's University, Stefanos Georgio , Safwat Hassan University of Toronto, Canada, Ying Zou Queen's University, Kingston, Ontario, Derek Truong IBM Canada, Toby Corbin IBM UK | ||
11:12 12mTalk | CrashJS: A NodeJS Benchmark for Automated Crash Reproduction Technical Papers Philip Oliver Victoria University of Wellington, Jens Dietrich Victoria University of Wellington, Craig Anslow Victoria University of Wellington, Michael Homer Victoria University of Wellington | ||
11:24 12mTalk | An Empirical Study on Just-in-time Conformal Defect Prediction Technical Papers Xhulja Shahini paluno - University of Duisburg-Essen, Andreas Metzger University of Duisburg-Essen, Klaus Pohl | ||
11:36 12mTalk | Fine-Grained Just-In-Time Defect Prediction at the Block Level in Infrastructure-as-Code (IaC) Technical Papers Mahi Begoug , Moataz Chouchen ETS, Ali Ouni ETS Montreal, University of Quebec, Eman Abdullah AlOmar Stevens Institute of Technology, Mohamed Wiem Mkaouer University of Michigan - Flint | ||
11:48 4mTalk | TrickyBugs: A Dataset of Corner-case Bugs in Plausible Programs Data and Tool Showcase Track Kaibo Liu Peking University, Yudong Han Peking University, Yiyang Liu Peking University, Zhenpeng Chen Nanyang Technological University, Jie M. Zhang King's College London, Federica Sarro University College London, Gang Huang Peking University, Yun Ma Peking University | ||
11:52 4mTalk | GitBugs-Java: A Reproducible Java Benchmark of Recent Bugs Data and Tool Showcase Track André Silva KTH Royal Institute of Technology, Nuno Saavedra INESC-ID and IST, University of Lisbon, Martin Monperrus KTH Royal Institute of Technology | ||
11:56 4mTalk | A Dataset of Partial Program Fixes Data and Tool Showcase Track Dirk Beyer LMU Munich, Lars Grunske Humboldt-Universität zu Berlin, Matthias Kettl LMU Munich, Marian Lingsch-Rosenfeld LMU Munich, Moeketsi Raselimo Humboldt-Universität zu Berlin | ||
12:00 4mTalk | BugsPHP: A dataset for Automated Program Repair in PHP Data and Tool Showcase Track K.D. Pramod University of Moratuwa, Sri Lanka, W.T.N. De Silva University of Moratuwa, Sri Lanka, W.U.K. Thabrew University of Moratuwa, Sri Lanka, Ridwan Salihin Shariffdeen National University of Singapore, Sandareka Wickramanayake University of Moratuwa, Sri Lanka Pre-print | ||
12:04 4mTalk | AW4C: A Commit-Aware C Dataset for Actionable Warning Identification Data and Tool Showcase Track Zhipeng Liu , Meng Yan Chongqing University, Zhipeng Gao Shanghai Institute for Advanced Study - Zhejiang University, Dong Li , Xiaohong Zhang Chongqing University, Dan Yang Chongqing University | ||
12:08 5mTalk | Predicting the Impact of Crashes Across Release Channels Industry Track | ||
12:13 5mTalk | Zero Shot Learning based Alternatives for Class Imbalanced Learning Problem in Enterprise Software Defect Analysis Industry Track |
14:00 - 15:30 | Mining ChallengeMining Challenge at Almada Negreiros Chair(s): Preetha Chatterjee Drexel University, USA, Fabio Palomba University of Salerno | ||
14:00 5mTalk | ChatGPT Chats Decoded: Uncovering Prompt Patterns for Superior Solutions in Software Development Lifecycle Mining Challenge Liangxuan Wu Huazhong University of Science and Technology, Yanjie Zhao Huazhong University of Science and Technology, Xinyi Hou Huazhong University of Science and Technology, Tianming Liu Monash Univerisity, Haoyu Wang Huazhong University of Science and Technology | ||
14:05 5mTalk | Write me this Code: An Analysis of ChatGPT Quality for Producing Source Code Mining Challenge Konstantinos Moratis Electrical and Computer Engineering Dept., Aristotle University of Thessaloniki, Themistoklis Diamantopoulos Electrical and Computer Engineering Dept, Aristotle University of Thessaloniki, Dimitrios-Nikitas Nastos Electrical and Computer Engineering Dept., Aristotle University of Thessaloniki, Andreas Symeonidis Aristotle University of Thessaloniki Pre-print | ||
14:10 5mTalk | Quality Assessment of ChatGPT Generated Code and their Use by Developers Mining Challenge Mohammed Latif Siddiq University of Notre Dame, Lindsay Roney University of Notre Dame, Jiahao Zhang , Joanna C. S. Santos University of Notre Dame Pre-print Media Attached File Attached | ||
14:15 5mTalk | Analyzing Developer Use of ChatGPT Generated Code in Open Source GitHub Projects Mining Challenge Balreet Grewal University of Alberta, Wentao Lu University of Alberta, Sarah Nadi New York University Abu Dhabi, University of Alberta, Cor-Paul Bezemer University of Alberta Pre-print | ||
14:20 5mTalk | How I Learned to Stop Worrying and Love ChatGPT Mining Challenge Piotr Przymus Nicolaus Copernicus University in Toruń, Poland, Mikołaj Fejzer Nicolaus Copernicus University in Toruń, Jakub Narębski Nicolaus Copernicus University in Toruń, Krzysztof Stencel University of Warsaw Pre-print | ||
14:25 5mTalk | Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation. Mining Challenge Kailun Jin York University, Chung-Yu Wang York University, Hung Viet Pham York University, Hadi Hemmati York University Pre-print | ||
14:30 5mTalk | The role of library versions in Developer-ChatGPT conversations Mining Challenge Pre-print | ||
14:35 5mTalk | AI Writes, We Analyze: The ChatGPT Python Code Saga Mining Challenge Md Fazle Rabbi Idaho State University, Arifa Islam Champa Idaho State University, Minhaz F. Zibran Idaho State University, Md Rakibul Islam Lamar University DOI Pre-print | ||
14:40 5mTalk | ChatGPT in Action: Analyzing Its Use in Software Development Mining Challenge Arifa Islam Champa Idaho State University, Md Fazle Rabbi Idaho State University, Costain Nachuma Idaho State University, Minhaz F. Zibran Idaho State University DOI Pre-print | ||
14:45 5mTalk | Chatting with AI: Deciphering Developer Conversations with ChatGPT Mining Challenge Suad Mohamed Belmont University, Abdullah Parvin Belmont University, Esteban Parra Belmont University | ||
14:50 5mTalk | Does Generative AI Generate Smells Related to Container Orchestration?: An Exploratory Study with Kubernetes Manifests Mining Challenge Yue Zhang Auburn University, Rachel Meredith Auburn University, Wilson Reaves Auburn University, Julia Coriolano Federal University of Pernambuco, Muhammad Ali Babar School of Computer Science, The University of Adelaide, Akond Rahman Auburn University Pre-print | ||
14:55 5mTalk | On the Taxonomy of Developers' Discussion Topics with ChatGPT Mining Challenge | ||
15:00 5mTalk | How to refactor this code? An exploratory study on developer-ChatGPT refactoring conversations Mining Challenge Eman Abdullah AlOmar Stevens Institute of Technology, AnushKrishna Venkatakrishnan Rochester Institute of Technology, USA, Mohamed Wiem Mkaouer University of Michigan - Flint, Christian Newman , Ali Ouni ETS Montreal, University of Quebec | ||
15:05 5mTalk | Analyzing Developer-ChatGPT Conversations for Software Refactoring: An Exploratory Study Mining Challenge Omkar Sandip Chavan Rochester Institute of Technology, Divya Dilip Hinge Rochester Institute of Technology, Soham Sanjay Deo Rochester Institute of Technology, Yaxuan (Olivia) Wang Rochester Institute of Technology, Mohamed Wiem Mkaouer University of Michigan - Flint | ||
15:10 5mTalk | How Do Software Developers Use ChatGPT? An Exploratory Study on GitHub Pull Requests Mining Challenge Moataz Chouchen ETS, Narjes Bessghaier ETS Montreal, University of Quebec, Mahi Begoug , Ali Ouni ETS Montreal, University of Quebec, Eman Abdullah AlOmar Stevens Institute of Technology, Mohamed Wiem Mkaouer University of Michigan - Flint | ||
15:15 5mTalk | Investigating the Utility of ChatGPT in the Issue Tracking System: An Exploratory Study Mining Challenge Joy Krishan Das University of Saskatchewan, Saikat Mondal University of Saskatchewan, Chanchal K. Roy University of Saskatchewan, Canada Pre-print | ||
15:20 5mTalk | Enhancing User Interaction in ChatGPT: Characterizing and Consolidating Multiple Prompts for Issue Resolution Mining Challenge Saikat Mondal University of Saskatchewan, Suborno Deb Bappon Department of Computer Science, University of Saskatchewan, Canada, Chanchal K. Roy University of Saskatchewan, Canada Pre-print |
14:00 - 15:30 | Software QualityTechnical Papers / Registered Reports / Data and Tool Showcase Track at Grande Auditório Chair(s): Gopi Krishnan Rajbahadur Centre for Software Excellence, Huawei, Canada | ||
14:00 12mTalk | Not all Dockerfile Smells are the Same: An Empirical Evaluation of Hadolint Writing Practices by Experts Technical Papers Giovanni Rosa University of Molise, Simone Scalabrino University of Molise, Gregorio Robles Universidad Rey Juan Carlos, Rocco Oliveto University of Molise | ||
14:12 12mTalk | Supporting High-Level to Low-Level Requirements Coverage Reviewing with Large Language Models Technical Papers Anamaria-Roberta Hartl Johannes Kepler University Linz, Christoph Mayr-Dorn JOHANNES KEPLER UNIVERSITY LINZ, Atif Mashkoor Johannes Kepler University Linz, Alexander Egyed Johannes Kepler University Linz DOI Authorizer link Pre-print | ||
14:24 12mTalk | On the Executability of R Markdown Files Technical Papers Md Anaytul Islam Lakehead University, Muhammad Asaduzzman University of Windsor, Shaowei Wang Department of Computer Science, University of Manitoba, Canada | ||
14:36 12mTalk | APIstic: A Large Collection of OpenAPI Metrics Technical Papers souhaila serbout Software Institute @ USI, Cesare Pautasso Software Institute, Faculty of Informatics, USI Lugano | ||
14:48 6mTalk | Improving Automated Code Reviews: Learning From Experience Technical Papers Hong Yi Lin The University of Melbourne, Patanamon Thongtanunam University of Melbourne, Christoph Treude Singapore Management University, Wachiraphan (Ping) Charoenwet The University of Melbourne | ||
14:55 4mTalk | Multi-faceted Code Smell Detection at Scale using DesigniteJava 2.0 Data and Tool Showcase Track Tushar Sharma Dalhousie University Pre-print | ||
14:59 4mTalk | SATDAUG - A Balanced and Augmented Dataset for Detecting Self-Admitted Technical Debt Data and Tool Showcase Track Edi Sutoyo Bernoulli Institute for Mathematics, Computer Science and Artificial Intelligence, University of Groningen, Andrea Capiluppi University of Groningen | ||
15:03 4mTalk | Curated Email-Based Code Reviews Datasets Data and Tool Showcase Track Mingzhao Liang The University of Melbourne, Wachiraphan (Ping) Charoenwet The University of Melbourne, Patanamon Thongtanunam University of Melbourne | ||
15:07 4mTalk | TestDossier: A Dataset of Tested Values Automatically Extracted from Test Execution Data and Tool Showcase Track Andre Hora UFMG Pre-print Media Attached | ||
15:11 4mTalk | Greenlight: Highlighting TensorFlow APIs Energy Footprint Data and Tool Showcase Track Saurabhsingh Rajput Dalhousie University, Maria Kechagia University College London, Federica Sarro University College London, Tushar Sharma Dalhousie University Pre-print | ||
15:15 5mTalk | When Code Smells Meet ML: On the Lifecycle of ML-specific Code Smells in ML-enabled Systems Registered Reports Gilberto Recupito University of Salerno, Giammaria Giordano University of Salerno, Filomena Ferrucci University of Salerno, Dario Di Nucci University of Salerno, Fabio Palomba University of Salerno | ||
15:20 5mTalk | Comparison of Static Analysis Architecture Recovery Tools for Microservice Applications Registered Reports Simon Schneider Hamburg University of Technology, Alexander Bakhtin University of Oulu, Xiaozhou Li University of Oulu, Jacopo Soldani University of Pisa, Italy, Antonio Brogi Università di Pisa, Tomas Cerny University of Arizona, Riccardo Scandariato Hamburg University of Technology, Davide Taibi University of Oulu and Tampere University |
16:00 - 17:30 | Mobile AppsData and Tool Showcase Track / Technical Papers at Almada Negreiros Chair(s): Dario Di Nucci University of Salerno | ||
16:00 12mTalk | Automating GUI-based Test Oracles for Mobile Apps Technical Papers Kesina Baral CQSE America, Jack Johnson , Junayed Mahmud George Mason University, Sabiha Salma George Mason University, Mattia Fazzini University of Minnesota, Julia Rubin University of British Columbia, Jeff Offutt George Mason University, Kevin Moran University of Central Florida | ||
16:12 12mTalk | Global Prosperity or Local Monopoly? Understanding the Geography of App Popularity Technical Papers Liu Wang Beijing University of Posts and Telecommunications, Conghui Zheng Beijing University of Posts and Telecommunications, Haoyu Wang Huazhong University of Science and Technology, Xiapu Luo The Hong Kong Polytechnic University, Gareth Tyson Queen Mary University of London, Yi Wang , Shangguang Wang Beijing University of Posts and Telecommunications | ||
16:24 12mTalk | GuiEvo: Automated Evolution of Mobile App UIs Technical Papers Sabiha Salma George Mason University, S M Hasan Mansur George Mason University, Yule Zhang George Mason University, Kevin Moran University of Central Florida | ||
16:36 12mTalk | Comparing Apples to Androids: Discovery, Retrieval, and Matching of iOS and Android Apps for Cross-Platform Analyses Technical Papers Magdalena Steinböck TU Wien, Jakob Bleier TU Wien, Mikka Rainer CISPA Helmholtz Center for Information Security, Tobias Urban Institute for Internet Security & secunet Security Networks AG, Christine Utz CISPA Helmholtz Center for Information Security, Martina Lindorfer TU Wien | ||
16:48 12mTalk | Keep Me Updated: An Empirical Study on Embedded Javascript Engines in Android Apps Technical Papers Elliott Wen The University of Auckland, Jiaxiang Liu The Hong Kong Polytechnic University, Xiapu Luo The Hong Kong Polytechnic University, Giovanni Russello University of Auckland, Jens Dietrich Victoria University of Wellington | ||
17:00 12mTalk | Large Language Model vs. Stack Overflow in Addressing Android Permission Related Challenges Technical Papers Sahrima Jannat Oishwee University of Saskatchewan, Natalia Stakhanova University of Saskatchewan, Zadia Codabux University of Saskatchewan, Canada | ||
17:12 4mTalk | DATAR: A Dataset for Tracking App Releases Data and Tool Showcase Track Yasaman Abedini Sharif University of Technology, Mohammad Hadi Hajihosseini Sharif University of Technology, Abbas Heydarnoori Bowling Green State University | ||
17:16 4mTalk | AndroZoo: A Retrospective with a Glimpse into the Future Data and Tool Showcase Track Marco Alecci University of Luxembourg, Pedro Jesús Ruiz Jiménez University of Luxembourg, Kevin Allix Independent Researcher, Tegawendé F. Bissyandé University of Luxembourg, Jacques Klein University of Luxembourg |
16:00 - 17:30 | Machine learning for Software EngineeringTechnical Papers at Grande Auditório Chair(s): Diego Costa Concordia University, Canada | ||
16:00 12mTalk | Whodunit: Classifying Code as Human Authored or GPT-4 Generated - A case study on CodeChef problems Technical Papers Oseremen Joy Idialu University of Waterloo, Noble Saji Mathews University of Waterloo, Canada, Rungroj Maipradit University of Waterloo, Joanne M. Atlee University of Waterloo, Mei Nagappan University of Waterloo DOI Pre-print | ||
16:12 12mTalk | GIRT-Model: Automated Generation of Issue Report Templates Technical Papers Nafiseh Nikehgbal Sharif University of Technology, Amir Hossein Kargaran LMU Munich, Abbas Heydarnoori Bowling Green State University DOI Pre-print | ||
16:24 12mTalk | MicroRec: Leveraging Large Language Models for Microservice Recommendation Technical Papers Ahmed Saeed Alsayed University of Wollongong, Hoa Khanh Dam University of Wollongong, Chau Nguyen University of Wollongong | ||
16:36 12mTalk | PeaTMOSS: A Dataset and Initial Analysis of Pre-Trained Models in Open-Source Software Technical Papers Wenxin Jiang Purdue University, Jerin Yasmin Queen's University, Canada, Jason Jones Purdue University, Nicholas Synovic Loyola University Chicago, Jiashen Kuo Purdue University, Nathaniel Bielanski Purdue University, Yuan Tian Queen's University, Kingston, Ontario, George K. Thiruvathukal Loyola University Chicago and Argonne National Laboratory, James C. Davis Purdue University DOI Pre-print | ||
16:48 12mTalk | Data Augmentation for Supervised Code Translation Learning Technical Papers Binger Chen Technische Universität Berlin, Jacek golebiowski Amazon AWS, Ziawasch Abedjan Leibniz Universität Hannover | ||
17:00 12mTalk | On the Effectiveness of Machine Learning-based Call-Graph Pruning: An Empirical Study Technical Papers Amir Mir Delft University of Technology, Mehdi Keshani Delft University of Technology, Sebastian Proksch Delft University of Technology Pre-print | ||
17:12 12mTalk | Leveraging GPT-like LLMs to Automate Issue Labeling Technical Papers Giuseppe Colavito University of Bari, Italy, Filippo Lanubile University of Bari, Nicole Novielli University of Bari, Luigi Quaranta University of Bari, Italy Pre-print |
Tue 16 AprDisplayed time zone: Lisbon change
09:00 - 10:30 | Development: practices and humans Data and Tool Showcase Track / Technical Papers at Almada Negreiros Chair(s): Gema Rodríguez-Pérez University of British Columbia (UBC) | ||
09:50 6mTalk | Exploring the Effect of Multiple Natural Languages on Code Suggestion Using GitHub Copilot Technical Papers Kei Koyanagi Kyushu University, Dong Wang Kyushu University, Japan, Kotaro Noguchi Kyushu University, Masanari Kondo Kyushu University, Alexander Serebrenik Eindhoven University of Technology, Yasutaka Kamei Kyushu University, Naoyasu Ubayashi Kyushu University Pre-print | ||
09:56 4mTalk | A Four-Dimension Gold Standard Dataset for Opinion Mining in Software Engineering Data and Tool Showcase Track Md Rakibul Islam Lamar University, Md Fazle Rabbi Idaho State University, Jo Youngeun Lamar University, Arifa Islam Champa Idaho State University, Ethan J Young Lamar University, Camden M Wilson Lamar University, Gavin J Scott Lamar University, Minhaz F. Zibran Idaho State University | ||
10:00 4mTalk | Opening the Valve on Pure-Data: Usage Patterns and Programming Practices of a Data-Flow Based Visual Programming Language Data and Tool Showcase Track Anisha Islam Department of Computing Science, University of Alberta, Kalvin Eng University of Alberta, Abram Hindle University of Alberta | ||
10:04 4mTalk | The PIPr Dataset of Public Infrastructure as Code Programs Data and Tool Showcase Track Daniel Sokolowski University of St. Gallen, David Spielmann University of St. Gallen, Guido Salvaneschi University of St. Gallen Link to publication DOI Pre-print | ||
10:08 4mTalk | A Dataset of Microservices-based Open-Source Projects Data and Tool Showcase Track Dario Amoroso d'Aragona Tampere University, Alexander Bakhtin University of Oulu, Xiaozhou Li University of Oulu, Ruoyu Su University of Oulu, Lauren Adams Baylor University, Ernesto Aponte Universidad del Sagrado Corazón, Francis Boyle Baylor University, Patrick Boyle Baylor University, Rachel Koerner Baylor University, Joseph Lee University of Richmond, Fangchao Tian University of Oulu, Yuqing Wang University of Oulu, Jesse Nyyssölä University of Helsinki, Ernesto Quevedo Baylor University, Shahidur Md Rahaman Baylor University, Amr Elsayed Baylor University, Mika Mäntylä University of Helsinki and University of Oulu, Tomas Cerny University of Arizona, Davide Taibi University of Oulu and Tampere University | ||
10:12 4mTalk | SensoDat: Simulation-based Sensor Dataset of Self-driving Cars Data and Tool Showcase Track Christian Birchler Zurich University of Applied Sciences & University of Bern, Cyrill Rohrbach University of Bern, Switzerland, Timo Kehrer University of Bern, Sebastiano Panichella Zurich University of Applied Sciences | ||
10:16 4mTalk | Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Locked GitHub Issue Threads Data and Tool Showcase Track Ramtin Ehsani Drexel University, Mia Mohammad Imran Virginia Commonwealth University, Robert Zita Elmhurst University, Kostadin Damevski Virginia Commonwealth University, Preetha Chatterjee Drexel University, USA | ||
10:20 4mTalk | A Dataset of Atoms of Confusion in the Android Open Source Project Data and Tool Showcase Track Davi Batista Tabosa Federal University of Ceará, Oton Pinheiro Federal University of Ceará, Lincoln Rocha Federal University of Ceará, Windson Viana Federal University of Ceará | ||
10:24 4mTalk | PlayMyData: a curated dataset of multi-platform video games Data and Tool Showcase Track Andrea D'Angelo University of L'Aquila, Claudio Di Sipio University of L'Aquila, Cristiano Politowski DIRO, University of Montreal, Riccardo Rubei University of L'Aquila |
09:00 - 10:30 | |||
09:00 45mKeynote | Questioning the questions we ask about the impact of AI on software engineering Keynotes Margaret-Anne Storey University of Victoria | ||
09:45 45mTalk | Open Source Software Digital Sociology: Quantifying and Managing Complex Open Source Software Ecosystem Tutorials Minghui Zhou Peking University, Yuxia Zhang Beijing Institute of Technology, Xin Tan Beihang University |
11:00 - 12:30 | Process automation & DevOps and Tutorial ITechnical Papers / Tutorials at Almada Negreiros Chair(s): Tom Mens University of Mons, Ayushi Rastogi University of Groningen, The Netherlands | ||
11:00 12mTalk | Learning to Predict and Improve Build Successes in Package Ecosystems Technical Papers Harshitha Menon Lawrence Livermore National Lab, Daniel Nichols University of Maryland, College Park, Abhinav Bhatele University of Maryland, College Park, Todd Gamblin Lawrence Livermore National Laboratory | ||
11:12 12mTalk | The Impact of Code Ownership of DevOps Artefacts on the Outcome of DevOps CI Builds Technical Papers Ajiromola Kola-Olawuyi University of Waterloo, Nimmi Rashinika Weeraddana University of Waterloo, Mei Nagappan University of Waterloo | ||
11:24 12mTalk | A Mutation-Guided Assessment of Acceleration Approaches for Continuous Integration: An Empirical Study of YourBase Technical Papers Zhili Zeng University of Waterloo, Tao Xiao Nara Institute of Science and Technology, Maxime Lamothe Polytechnique Montreal, Hideaki Hata Shinshu University, Shane McIntosh University of Waterloo Pre-print | ||
11:45 45mTalk | Cohort Studies for Mining Software Repositories Tutorials Nyyti Saarimäki Tampere University, Sira Vegas Universidad Politecnica de Madrid, Valentina Lenarduzzi University of Oulu, Davide Taibi University of Oulu and Tampere University , Mikel Robredo University of Oulu |
11:00 - 12:30 | Software Evolution & AnalysisTechnical Papers / Data and Tool Showcase Track / Industry Track at Grande Auditório Chair(s): Vladimir Kovalenko JetBrains Research | ||
11:00 12mTalk | Unveiling ChatGPT's Usage in Open Source Projects: A Mining-based Study Technical Papers Rosalia Tufano Università della Svizzera Italiana, Antonio Mastropaolo Università della Svizzera italiana, Federica Pepe University of Sannio, Ozren Dabic Software Institute, Università della Svizzera italiana (USI), Switzerland, Massimiliano Di Penta University of Sannio, Italy, Gabriele Bavota Software Institute @ Università della Svizzera Italiana | ||
11:12 12mTalk | DRMiner: A Tool For Identifying And Analyzing Refactorings In Dockerfile Technical Papers Emna Ksontini University of Michigan - Dearborn, Aycha Abid Oakland University, Rania Khalsi University of Michigan - Flint, Marouane Kessentini University of Michigan - Flint | ||
11:24 12mTalk | A Large-Scale Empirical Study of Open Source License Usage: Practices and Challenges Technical Papers Jiaqi Wu Zhejiang University, Lingfeng Bao Zhejiang University, Xiaohu Yang Zhejiang University, Xin Xia Huawei Technologies, Xing Hu Zhejiang University | ||
11:36 12mTalk | Analyzing the Evolution and Maintenance of ML Models on Hugging Face Technical Papers Joel Castaño Fernández Universitat Politècnica de Catalunya, Silverio Martínez-Fernández UPC-BarcelonaTech, Xavier Franch Universitat Politècnica de Catalunya, Justus Bogner Vrije Universiteit Amsterdam Link to publication Pre-print | ||
11:48 12mTalk | On the Anatomy of Real-World R Code for Static Analysis Technical Papers Florian Sihler Ulm University, Lukas Pietzschmann Ulm University, Raphael Straub Ulm University, Matthias Tichy Ulm University, Germany, Andor Diera Ulm University, Abdelhalim Dahou GESIS Leibniz Institute for the Social Sciences Pre-print File Attached | ||
12:00 6mTalk | Encoding Version History Context for Better Code Representation Technical Papers Huy Nguyen The University of Melbourne, Christoph Treude Singapore Management University, Patanamon Thongtanunam University of Melbourne Pre-print | ||
12:06 4mTalk | CodeLL: A Lifelong Learning Dataset to Support the Co-Evolution of Data and Language Models of Code Data and Tool Showcase Track Martin Weyssow DIRO, Université de Montréal, Claudio Di Sipio University of L'Aquila, Davide Di Ruscio University of L'Aquila, Houari Sahraoui DIRO, Université de Montréal | ||
12:10 4mTalk | Bidirectional Paper-Repository Tracing in Software Engineering Data and Tool Showcase Track Daniel Garijo , Miguel Arroyo Universidad Politécnica de Madrid, Esteban González Guardia Universidad Politécnica de Madrid, Christoph Treude Singapore Management University, Nicola Tarocco CERN | ||
12:14 4mTalk | DistilKaggle: A Distilled Dataset of Kaggle Jupyter Notebooks Data and Tool Showcase Track Mojtaba Mostafavi Department of Computer Engineering of Sharif University of Technology, Arash Asgari Department of Computer Engineering of Sharif University of Technology, Mohammad Abolnejadian Department of Computer Engineering of Sharif University of Technology, Abbas Heydarnoori Bowling Green State University | ||
12:18 5mTalk | Estimating Usage of Open Source Projects Industry Track |
14:00 - 15:30 | Process automation & DevOps IITechnical Papers / Data and Tool Showcase Track at Almada Negreiros Chair(s): Shane McIntosh University of Waterloo | ||
14:00 12mTalk | Options Matter: Documenting and Fixing Non-Reproducible Builds in Highly-Configurable Systems Technical Papers Georges Aaron RANDRIANAINA Université de Rennes 1, IRISA, Djamel Eddine Khelladi CNRS, IRISA, University of Rennes, Olivier Zendra Inria, Mathieu Acher University of Rennes, France / Inria, France / CNRS, France / IRISA, France | ||
14:12 12mTalk | How do Machine Learning Projects use Continuous Integration Practices? An Empirical Study on GitHub Actions Technical Papers João Helis Bernardo Federal Institute of Education, Science and Technology of Rio Grande do Norte, Daniel Alencar Da Costa University of Otago, Sergio Queiroz de Medeiros Universidade Federal do Rio Grande do Norte, Uirá Kulesza Federal University of Rio Grande do Norte DOI Pre-print | ||
14:24 4mTalk | A dataset of GitHub Actions workflow histories Data and Tool Showcase Track Guillaume Cardoen University of Mons, Tom Mens University of Mons, Alexandre Decan University of Mons; F.R.S.-FNRS | ||
14:28 4mTalk | gawd: A Differencing Tool for GitHub Actions Workflows Data and Tool Showcase Track Pooya Rostami Mazrae University of Mons, Alexandre Decan University of Mons; F.R.S.-FNRS, Tom Mens University of Mons | ||
14:32 4mTalk | RABBIT: A tool for identifying bot accounts based on their recent GitHub event history Data and Tool Showcase Track Natarajan Chidambaram University of Mons, Tom Mens University of Mons, Alexandre Decan University of Mons; F.R.S.-FNRS | ||
14:36 12mTalk | An Investigation of Patch Porting Practices of the Linux Kernel Ecosystem Technical Papers Xingyu Li UC Riverside, Zheng Zhang UC Riverside, Zhiyun Qian University of California at Riverside, USA, Trent Jaeger UC Riverside, Chengyu Song University of California at Riverside, USA | ||
14:48 4mTalk | BugsPHP: A dataset for Automated Program Repair in PHP Data and Tool Showcase Track K.D. Pramod University of Moratuwa, Sri Lanka, W.T.N. De Silva University of Moratuwa, Sri Lanka, W.U.K. Thabrew University of Moratuwa, Sri Lanka, Ridwan Salihin Shariffdeen National University of Singapore, Sandareka Wickramanayake University of Moratuwa, Sri Lanka Pre-print |
16:00 - 17:30 | Day 2: ClosingMSR Awards / Vision and Reflection at Grande Auditório Chair(s): Alberto Bacchelli University of Zurich | ||
16:00 30mTalk | MSR in the age of LLMs Vision and Reflection Christoph Treude Singapore Management University | ||
16:30 30mTalk | Idealists and Pragmatists—An Only Somewhat Self-Indulgent Reflection on the Development of an MSR Paper (and Researcher) Vision and Reflection Shane McIntosh University of Waterloo | ||
17:00 30mDay closing | Closing session MSR Awards Diomidis Spinellis Athens University of Economics and Business & Delft University of Technology, Olga Baysal |
Unscheduled Events
Not scheduled Talk | Empirical Studies of Parameter Efficient Methods for Large Language Models of Code and Knowledge Transfer to R Registered Reports Amirreza Esmaeili University of British Columbia, Iman Saberi University of British Columbia Okanagan, Fatemeh Hendijani Fard University of British Columbia | ||
Not scheduled Talk | Assessing the Influence of Toxic and Gender Discriminatory Communication on Perceptible Diversity in OSS Projects Registered Reports Sayma Sultana Wayne State University, Gias Uddin York University, Canada, Amiangshu Bosu Wayne State University Pre-print |
Accepted Papers
Call for Registrations
Empirical Software Engineering Journal (EMSE), in conjunction with the conference on Mining Software Repositories (MSR), is continuing the RR track. The RR track of MSR 2024 has two goals: (1) to prevent HARKing (hypothesizing after the results are known) for empirical studies; (2) to provide early feedback to authors in their initial study design. For papers submitted to the RR track, methods and proposed analyses are reviewed prior to execution. Pre-registered studies follow a two-step process:
- Stage 1: A report is submitted that describes the planned study. The submitted report is evaluated by the reviewers of the RR track of MSR 2024. Authors of accepted pre-registered studies will be given the opportunity to present their work at MSR.
- Stage 2: Once a report has passed Phase 1, the study will be conducted and actual data collection and analysis take place. The results may also be negative! The full paper is submitted for review to EMSE.
Paper Types, Evaluation Criteria, and Acceptance Types
The RR track of MSR 2024 supports two types of papers:
Confirmatory : The researcher has a fixed hypothesis (or several fixed hypotheses) and the objective of the study is to find out whether the hypothesis is supported by the facts/data.
An example of a completed confirmatory study:
- Inozemtseva, L., & Holmes, R. (2014, May). Coverage is not strongly correlated with test suite effectiveness. In Proceedings of the 36th international conference on software engineering (pp. 435-445).
Exploratory : The researcher does not have a hypothesis (or has one that may change during the study). Often, the objective of such a study is to understand what is observed and answer questions such as WHY, HOW, WHAT, WHO, or WHEN. We include in this category registrations for which the researcher has an initial proposed solution for an automated approach (e.g., a new deep-learning-based defect prediction approach) that serves as a starting point for his/her exploration to reach an effective solution.
Examples of completed exploratory studies:
- Gousios, G., Pinzger, M., & Deursen, A. V. (2014, May). An exploratory study of the pull-based software development model. In Proceedings of the 36th International Conference on Software Engineering (pp. 345-355).
- Rodrigues, I. M., Aloise, D., Fernandes, E. R., & Dagenais, M. (2020, June). A Soft Alignment Model for Bug Deduplication. In Proceedings of the 17th International Conference on Mining Software Repositories (pp. 43-53).
The reviewers will evaluate RR track submissions based on the following criteria:
- The importance of the research question(s).
- The logic, rationale, and plausibility of the proposed hypotheses.
- The soundness and feasibility of the methodology and analysis pipeline (including statistical power analysis where appropriate).
- (For confirmatory study) Whether the clarity and degree of methodological detail is sufficient to exactly replicate the proposed experimental procedures and analysis pipeline.
- (For confirmatory study) Whether the authors have pre-specified sufficient outcome-neutral tests for ensuring that the results obtained can test the stated hypotheses, including positive controls and quality checks.
- (For exploratory study, if applicable) The description of the data set that is the base for exploration.
The outcome of the RR report review is one of the following:
- In-Principal Acceptance (IPA): The reviewers agree that the study is relevant, the outcome of the study (whether confirmation / rejection of hypothesis) is of interest to the community, the protocol for data collection is sound, and that the analysis methods are adequate. The authors can engage in the actual study for Stage 2. If the protocol is adhered to (or deviations are thoroughly justified), the study is published. Of course, this being a journal submission, a revision of the submitted manuscript may be necessary. Reviewers will especially evaluate how precisely the protocol of the accepted pre-registered report is followed, or whether deviations are justified.
- Continuity Acceptance (CA): The reviewers agree that the study is relevant, that the (initial) methods appear to be appropriate. However, for exploratory studies, implementation details and post-experiment analyses or discussion (e.g., why the proposed automated approach does not work) may require follow-up checks. We’ll try our best to get the original reviewers. All PC members will be invited on the condition that they agree to review papers in both, Stage 1 and Stage 2. Four (4) PC members will review the Stage 1 submission, and three (3) will review the Stage 2 submission.
- Rejection The reviewers do not agree on the relevance of the study or are not convinced that the study design is sufficiently mature. Comments are provided to the authors to improve the study design before starting it.
Note : For MSR 2024, only confirmatory studies are granted an IPA. Exploratory study in software engineering often cannot be adequately assessed until after the study has been completed and the findings are elaborated and discussed in a full paper. For example, consider a study in an RR proposing defect prediction using a new deep learning architecture. This work falls under the exploratory category. It is difficult to offer IPA, as we do not know whether it is any better than a traditional approach based on e.g., decision trees. Negative results are welcome; however, it is important that the negative results paper goes beyond presenting “we tried and failed”, but rather provide interesting insights to readers, e.g., why the results are negative or what that means for further studies on this topic (following criteria of REplication and Negative Results (RENE) tracks, e.g., https://saner2023.must.edu.mo/negativerestrack). Furthermore, it is important to note that authors are required to document all deviations (if any) in a section of the paper.
Submission Process and Instructions
The timeline for MSR 2024 RR track will be as follows:
Dec 12, 2023 : Authors submit their initial report. _Submissions must not exceed 6 pages (plus 1 additional page of references). The page limit is strict.
All authors should use the official “ACM Primary Article Template”, as can be obtained from the ACM Proceedings Template page. LaTeX users should use the sigconf option, as well as the review (to produce line numbers for easy reference by the reviewers). To that end, the following LaTeX code can be placed at the start of the LaTeX document: \documentclass[sigconf,review]{acmart}
Feb 2, 2024 : Authors receive PC members’ reviews.
Feb 16, 2024 : Authors submit a response letter + revised report in a single PDF.
- The response letter should address reviewer comments and questions.
- The response letter + revised report must not exceed 12 pages (plus 1 additional page of references).
- The response letter does not need to follow ACM formatting instructions.
Mar 8, 2024 : Notification of Stage 1
- (Outcome: in-principal acceptance, continuity acceptance, or rejection).
Mar 15, 2024 : Authors submit their accepted RR report to arXiv
- To be checked by PC members for Stage 2
- Note: Due to the timeline, RR reports will not be published in the MSR 2024 proceedings.
Before Dec 8, 2024 : Authors submit a full paper to EMSE. Instructions will be provided later. However, the following constraints will be enforced:
- Justifications need to be given to any change of authors. If the authors are added/removed or the author order is changed between the original Stage 1 and the EMSE submission, all authors will need to complete and sign a “Change of authorship request form”. The Editors in Chief of EMSE and chairs of the RR track reserve the right to deny author changes. If you anticipate any authorship changes please reach out to the chairs of the RR track as early as possible.
- PC members who reviewed an RR report in Stage 1 and their directly supervised students cannot be added as authors of the corresponding submission in Stage 2.
Submissions can be made via the submission site (https://msr2024-registered-report.hotcrp.com/) by the submission deadline. Any submission that does not comply with the aforementioned instructions and the mandatory information specified in the Author Guide is likely to be desk rejected. In addition, by submitting, the authors acknowledge that they are aware of and agree to be bound by the following policies:
- The ACM Policy and Procedures on Plagiarism and the IEEE Plagiarism FAQ. In particular, papers submitted to MSR 2024 must not have been published elsewhere and must not be under review or submitted for review elsewhere whilst under consideration for MSR 2024. Contravention of this concurrent submission policy will be deemed a serious breach of scientific ethics, and appropriate action will be taken in all such cases (including immediate rejection and reporting of the incident to ACM/IEEE). To check for double submission and plagiarism issues, the chairs reserve the right to (1) share the list of submissions with the PC Chairs of other conferences with overlapping review periods and (2) use external plagiarism detection software, under contract to the ACM or IEEE, to detect violations of these policies. The authorship policy of the ACM and the authorship policy of the IEEE.