Skip to content
JYSun edited this page May 23, 2024 · 66 revisions

Paper Reading Group

The paper reading group meets weekly during the semester to discuss papers. Participation is open to all, guests are always welcome; if you are interested in receiving invitations contact the organizer.

Each week we will discuss a different paper. The paper to discuss is announced about one week in advance by the organizer. All participants are expected to read the paper before the meeting. It is recommended to take notes about insights, questions, and other points potentially worth discussing.

The goals of the reading group are:

  • Critical reflection on scientific work
  • Practice of reading and argumentation strategies
  • Exposure to a broad range of research topics
  • Practice of leading group discussions

The discussion is limited to one hour. The discussion is led by a moderator, who may also set a focus for the discussion. The moderator will kick off the meeting by giving a short summary of the paper and raising a few points for discussion. The moderator should try to incorporate all participants into the discussion. The moderator role rotates through all participants. The moderator is encouraged to help with the selection of a paper that week.

Here is one blogpost about How to Read Academic Papers.

Agenda

May 21, 2024

Qi Guo, Junming Cao, Xiaofei Xie, Shangqing Liu, Xiaohong Li, Bihuan Chen, and Xin Peng. 2024. Exploring the Potential of ChatGPT in Automated Code Refinement: An Empirical Study. In Proceedings of the IEEE/ACM 46th International Conference on Software Engineering (ICSE '24). Association for Computing Machinery, New York, NY, USA, Article 34, 1–13. Moderator: Rohith Pudari

May 14, 2024

Pertseva, Elizaveta, et al. "A Theory of Scientific Programming Efficacy." Proceedings of the IEEE/ACM 46th International Conference on Software Engineering. 2024. Moderator: Jiessie Tie

April 1, 2024

Fang, Hongbo, James Herbsleb, and Bogdan Vasilescu. "Matching Skills, Past Collaboration, and Limited Competition: Modeling When Open-Source Projects Attract Contributors." Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2023. Moderator: Jiayi Sun

March 25, 2024

Wang, Zijie J., Katie Dai, and W. Keith Edwards. “Stickyland: Breaking the linear presentation of computational notebooks.” CHI Conference on Human Factors in Computing Systems Extended Abstracts. 2022.Moderator: Sepehr Hosseini Khorasgani

March 11, 2024

Choudhuri, R., Liu, D., Steinmacher, I., Gerosa, M., & Sarma, A. (2023). How Far Are We? The Triumphs and Trials of Generative AI in Learning Software Engineering. arXiv preprint arXiv:2312.11719. Accepted for ICSE2024 Moderator: Shurui Zhou

March 4, 2024

Hancheng Cao, Chia-Jung Lee, Shamsi Iqbal, Mary Czerwinski, Priscilla N Y Wong, Sean Rintel, Brent Hecht, Jaime Teevan, and Longqi Yang. 2021. Large Scale Analysis of Multitasking Behavior During Remote Meetings. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI '21). Association for Computing Machinery, New York, NY, USA, Article 448, 1–13. Moderator: Kathy Cheng

Feb 26, 2024

J.D. Zamfirescu-Pereira, Richmond Y. Wong, Bjoern Hartmann, and Qian Yang. 2023. Why Johnny Can’t Prompt: How Non-AI Experts Try (and Fail) to Design LLM Prompts. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23). Association for Computing Machinery, New York, NY, USA, Article 437, 1–21. Moderator: Jiessie

Feb 12, 2024

James Ivers, Robert L. Nord, Ipek Ozkaya, Chris Seifried, Christopher S. Timperley, and Marouane Kessentini. 2022. Industry experiences with large-scale refactoring. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022). Association for Computing Machinery, New York, NY, USA, 1544–1554. Moderator: Rohith Pudari

Jan 29, 2024

Fang, Hongbo, James Herbsleb, and Bogdan Vasilescu. "Novelty Begets Popularity, But Curbs Participation - A Macroscopic View of the Python Open-Source Ecosystem." ICSE 2024. Moderator: Jiayi Sun

Jan 22, 2024

Li, Xingjun, Yizhi Zhang, Justin Leung, Chengnian Sun, and Jian Zhao. "EDAssistant: Supporting Exploratory Data Analysis in Computational Notebooks with In Situ Code Search and Recommendation." ACM Transactions on Interactive Intelligent Systems 13, no. 1 (2023): 1-27. Moderator: Sepehr Hosseini Khorasgani

Dec 11, 2023

Felipe Fronchetti, David C. Shepherd, Igor Wiese, Christoph Treude, Marco Aurélio Gerosa, and Igor Steinmacher. 2023. "Do CONTRIBUTING Files Provide Information about OSS Newcomers’ Onboarding Barriers?" In Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023). Association for Computing Machinery, New York, NY, USA, 16–28. Moderator: Shurui Zhou

Nov 20, 2023

A. M. Eilertsen and G. C. Murphy, "The Usability (or Not) of Refactoring Tools," 2021 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), Honolulu, HI, USA, 2021, pp. 237-248, doi: 10.1109/SANER50967.2021.00030. Moderator: Rohith Pudari

Nov 13, 2023

Lora Oehlberg, Wesley Willett, and Wendy E. Mackay. 2015. Patterns of Physical Design Remixing in Online Maker Communities. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI '15). Association for Computing Machinery, New York, NY, USA, 639–648. Moderator: Kathy Cheng

Oct 30, 2023

Yuxia Zhang, Klaas-Jan Stol, Hui Liu, and Minghui Zhou. 2022. "Corporate dominance in open source ecosystems: a case study of OpenStack". In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2022). Association for Computing Machinery, New York, NY, USA, 1048–1060.  Moderator: Jiayi Sun

Oct 23, 2023

Ángel Alexander Cabrera, Marco Tulio Ribeiro, Bongshin Lee, Robert Deline, Adam Perer, and Steven M. Drucker. 2023. "What Did My AI Learn? How Data Scientists Make Sense of Model Behavior." ACM Trans. Comput.-Hum. Interact. 30, 1, Article 1 (February 2023), 27 pages. Moderator: Sepehr Hosseini Khorasgani

Oct 2, 2023

Alexopoulos, Nikolaos, Manuel Brack, Jan Philipp Wagner, Tim Grube, and Max Mühlhäuser. "How Long Do Vulnerabilities Live in the Code? A Large-Scale Empirical Measurement Study on FOSS Vulnerability Lifetimes." In 31st USENIX Security Symposium (USENIX Security 22), pp. 359-376. 2022. Moderator: Arjun Sridharkumar

Sep 25, 2023

Wang et al. Can Machine Learning Pipelines Be Better Configured? The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE) 2023. [accepted paper list for FSE2023] Moderator: Shurui Zhou

August 21, 2023

Jinglin Peng, Weiyuan Wu, Brandon Lockhart, Song Bian, Jing Nathan Yan, Linghao Xu, Zhixuan Chi, Jeffrey M. Rzeszotarski, and Jiannan Wang. 2021. DataPrep.EDA: Task-Centric Exploratory Data Analysis for Statistical Modeling in Python. In Proceedings of the 2021 International Conference on Management of Data (SIGMOD '21). Association for Computing Machinery, New York, NY, USA, 2271–2280. Moderator: Yuhang Yan

August 14, 2023

Miryung Kim, Thomas Zimmermann, and Nachiappan Nagappan. 2012. A field study of refactoring challenges and benefits. In Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering (FSE '12). Association for Computing Machinery, New York, NY, USA, Article 50, 1–11. Moderator: Rohith Pudari

June 26, 2023

Srishti Palani, David Ledo, George Fitzmaurice, and Fraser Anderson. 2022. ”I don’t want to feel like I’m working in a 1960s factory”: The Practitioner Perspective on Creativity Support Tool Adoption. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems (CHI '22). Moderator: Kathy Cheng

June 5, 2023

Xin Tan, Kai Gao, Minghui Zhou, and Li Zhang. 2022. “An exploratory study of deep learning supply chain”. In Proceedings of the 44th International Conference on Software Engineering (ICSE '22). Association for Computing Machinery, New York, NY, USA, 86–98. Moderator: Jiayi Sun

April 17, 2023

M. Wei, N. S. Harzevili, Y. Huang, J. Wang and S. Wang, "CLEAR: Contrastive Learning for API Recommendation," 2022 IEEE/ACM 44th International Conference on Software Engineering (ICSE), Pittsburgh, PA, USA, 2022, pp. 376-387. Moderator: Rohith Pudari

April 10, 2023

Lingfeng Bao, Xin Xia, Ahmed E. Hassan, and Xiaohu Yang. 2022. V-SZZ: automatic identification of version ranges affected by CVE vulnerabilities. In Proceedings of the 44th International Conference on Software Engineering (ICSE '22). Moderator: Arjun Sridharkumar

April 3, 2023

Nathaniel Weinman, Steven M. Drucker, Titus Barik, and Robert DeLine. 2021. Fork It: Supporting Stateful Alternatives in Computational Notebooks. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems (CHI '21). Moderator: Enmeng Liu

Mar 27, 2023

Sun, Jiamou, et al. “Task-oriented api usage examples prompting powered by programming task knowledge graph.” 2021 IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 2021. Moderator: Shurui Zhou

Mar 7, 2023

Chen, Xinyun, et al. "Spreadsheetcoder: Formula prediction from semi-structured context." International Conference on Machine Learning. PMLR, 2021. Moderator: Jiayi Sun

Feb 27, 2023

Amershi, Saleema, et al. "Software engineering for machine learning: A case study." 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP). IEEE, 2019. Moderator: Rohith Pudari

Feb 6, 2023

Pan, Shengyi, et al. "Automated unearthing of dangerous issue reports." Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2022. Moderator: Arjun Sridharkumar

Jan 30, 2023

Liu, Michael Xieyang, et al. "Crystalline: Lowering the Cost for Developers to Collect and Organize Information for Decision Making." CHI Conference on Human Factors in Computing Systems. 2022. Moderator: Enmeng Liu

Jan 16, 2023

Zhang, Yuxia, et al. "How do companies collaborate in open source ecosystems? an empirical study of OpenStack." ICSE 2020. Moderator: Jiayi Sun

Jan 9, 2023

Feng, Zixuan, et al. "A case study of implicit mentoring, its prevalence, and impact in Apache." ESEC/FSE 2022. Moderator: Shurui Zhou

Aug 26, 2022

Vaswani et al. (2017). Attention is all you need. Advances in neural information processing systems, 30. Moderator: Robert Ren

Aug 5, 2022

Tang, Yiming, et al. "Automated evolution of feature logging statement levels using Git histories and degree of interest." Science of Computer Programming 214 (2022): 102724. Moderator: Phil Cuvin

July 29, 2022

North, Kevin J., Anita Sarma, and Myra B. Cohen. "Understanding git history: A multi-sense view." Proceedings of the 8th International Workshop on Social Software Engineering. 2016. Moderator: Xinyan He

July 22, 2022

Saha, Ripon K., et al. "SapientML: Synthesizing Machine Learning Pipelines by Learning from Human-Written Solutions."ICSE2022 [presentation] Moderator: Chan Yang

July 8, 2022

Reid, David, Mahmoud Jahanshahi, and Audris Mockus. The Extent of Orphan Vulnerabilities from Code Reuse in Open Source Software. (ICSE2022). Moderator: Arjun Sridharkumar

June 24, 2022

Yan, Jing Nathan, Ziwei Gu, and Jeffrey M. Rzeszotarski. "Tessera: Discretizing data analysis workflows on a task level." Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 2021. Moderator: Enmeng Liu

June 3, 2022

Gote, Christoph, et al. "Big Data= Big Insights? Operationalising Brooks' Law in a Massive GitHub Data Set." ICSE (2022). Moderator: Jiayi Sun

May 20, 2022

Li, Renee, et al. Code of Conduct Conversations in Open Source Software Projects on Github. Proceedings of the ACM on Human-Computer Interaction 5.CSCW1 (2021): 1-31. Moderator: Minghao Li

May 13, 2022

Hanses, Sarah, and Jennifer Wang. How Do Users Interact with AI Features in the Workplace? Understanding the AI Feature User Journey in Enterprise. CHI Conference on Human Factors in Computing Systems Extended Abstracts. 2022. Moderator: Shurui Zhou

May 6, 2022

Muller, Michael, et al. How data science workers work with data: Discovery, capture, curation, design, creation. Proceedings of the 2019 CHI conference on human factors in computing systems. 2019. Moderator: Enmeng Liu

April 22, 2022

Nguyen, Giang, et al. Manas: Mining Software Repositories to Assist AutoML. ICSE2022. Moderator: Shurui Zhou

April 8, 2022

Chopra, Ashish, et al. @ alex, this fixes# 9": Analysis of Referencing Patterns in Pull Request Discussions. CSCW(2021): 1-25. Moderator: Jiayi Sun

April 1, 2022

Jimenez, Matthieu, et al. The importance of accounting for real-world labelling when predicting software vulnerabilities. Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE). 2019.Moderator: Arjun Sridharkumar

Mar 25, 2022

Wang, Dakuo, et al.Autods: Towards human-centered automation of data science. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. 2021. Moderator: Enmeng Liu

Mar 18, 2022

Yang, Qian, et al. Grounding interactive machine learning tool design in how non-experts actually build models. Proceedings of the 2018 designing interactive systems conference. 2018. Moderator: Shurui Zhou

Mar 11, 2022

Liu, Chengwei, et al. Demystifying the Vulnerability Propagation and Its Evolution via Dependency Trees in the NPM Ecosystem. ICSE 2022.Moderator: Jiayi Sun

Mar 4, 2022

A. Sabetta and M. Bezzi, A practical approach to the automatic classification of security-relevant commits, in Proceedings of the 34th IEEE International Conference on Software Maintenance and Evolution (ICSME). IEEE, 2018 Moderator: Arjun Sridharkumar

Feb 18, 2022

Maltbie, Nicholas, et al. XAI tools in the public sector: A case study on predicting combined sewer overflows. Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2021. Moderator: Enmeng Liu

Feb 11, 2022

Sung, Chungha, et al. Towards understanding and fixing upstream merge induced conflicts in divergent forks: An industrial case study. Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering: Software Engineering in Practice. 2020 Moderator: Shurui Zhou

Feb 4, 2022

Ma, Wanwangying, et al. How do developers fix cross-project correlated bugs? a case study on the github scientific python ecosystem 2017 IEEE/ACM 39th International Conference on Software Engineering (ICSE). IEEE, 2017. Moderator: Jiayi Sun

Jan 28, 2022

Sabottke, Carl, Octavian Suciu, and Tudor Dumitraș. Vulnerability disclosure in the age of social media: Exploiting twitter for predicting real-world exploits 24th {USENIX} Security Symposium ({USENIX} Security 15). 2015. Moderator: Arjun Sridharkumar

Jan 21, 2022

Wu, Huayao, et al. Identifying Key Features from App User Reviews. 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE). IEEE, 2021.Moderator: Enmeng Liu

Jan 14, 2022

Kenneth Holstein, Jennifer Wortman Vaughan, Hal Daumé III, Miro Dudik, and Hanna Wallach. 2019. Improving fairness in machine learning systems: What do industry practitioners need? In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. ACM, 600.Moderator: Shurui Zhou

Dec 13, 2021

Arvanitou, Elvira-Maria, et al. Software engineering practices for scientific software development: A systematic mapping study. Journal of Systems and Software 172 (2021): 110848. Moderator: Jiayi Sun

Dec 6, 2021

Liao, Q. Vera, Daniel Gruen, and Sarah Miller. Questioning the AI: informing design practices for explainable AI user experiences. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. Moderator: Enmeng Liu

Nov 29, 2021

Zhao, Liping, et al. "Natural Language Processing for Requirements Engineering: A Systematic Mapping Study." ACM Computing Surveys (CSUR) 54.3 (2021): 1-41. Moderator: Shurui Zhou

Nov 1, 2021

Zhou, J., Pacheco, M., Wan, Z., Xia, X., Lo, D., Wang, Y., & Hassan, A. E. Finding A Needle in a Haystack: Automated Mining of Silent Vulnerability Fixes. ASE'2021. Moderator: Jiayi Sun

Oct 25, 2021

Pornprasit, C., Tantithamthavorn, C., Jiarpakdee, J., Fu, M., & Thongtanunam, P. PyExplainer: Explaining the Predictions of Just-In-Time Defect Models. ASE2021 Moderator: Enmeng Liu

Oct 18, 2021

Wang et al. Human-AI Collaboration in Data Science: Exploring Data Scientists' Perceptions of Automated AI. Proceedings of the ACM on Human-Computer Interaction 3, no. CSCW (2019): 1-24. Moderator: Shurui Zhou

Aug 9, 2021

Zhang, Jie M., and Mark Harman. "Ignorance and Prejudice” in Software Fairness. 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE). IEEE, 2021.Moderator: Shurui Zhou

July 26, 2021

Rath, Michael, et al. Traceability in the wild: automatically augmenting incomplete trace links. Proceedings of the 40th International Conference on Software Engineering. 2018. Moderator: Zihan Chen

July 19, 2021

Biazzini, Marco, and Benoit Baudry. May the fork be with you: novel metrics to analyze collaboration on GitHub. Proceedings of the 5th international workshop on emerging trends in software metrics. 2014.Moderator: Linhao Liu

July 12, 2021

Lisha Li, Zhilei Ren, Xiaochen Li, Weiqin Zou, and He Jiang. 2018. How are Issue Units Linked? Empirical Study on the Linking Behavior in GitHub. In Proceedings of the 25th Asia-Pacific Software Engineering Conference. Moderator: Nilofer Hyder

July 5, 2021

Saini, Vaibhav, et al. Oreo: Detection of clones in the twilight zone. Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 2018. Moderator: Hamza Dugmag

June 28, 2021

Chattopadhyay, Souti, et al. What's Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. 2020. Moderator: Jimmy Yang

June 21, 2021

Kim, Youngtaek, et al. Githru: Visual Analytics for Understanding Software Development History Through Git Metadata Analysis. arXiv preprint arXiv:2009.03115 (2020).Moderator: Vicky Xu

June 14, 2021

Li, Jian, et al. Software defect prediction via convolutional neural network. 2017 IEEE International Conference on Software Quality, Reliability and Security (QRS). IEEE, 2017.Moderator: Willis Guo

June 7, 2021

Hepworth, Ammon I., et al. Automated conflict avoidance in multi-user CAD. Computer-Aided Design and Applications 11.2 (2014): 141-152. Moderator: Jasmine Zhang

May 31, 2021

Heaton, Dustin, and Jeffrey C. Carver. Claims about the use of software engineering practices in science: A systematic literature review. Information and Software Technology 67 (2015): 207-219. Moderator: Sophie Kim

May 17, 2021

García, S., Strüber, D., Brugali, D., Berger, T. and Pelliccione, P., 2020, November. Robotics software engineering: A perspective from the service robotics domain. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (pp. 593-604). (ESEF/FSE 2020). Moderator: Jiayi Sun

May 10, 2021

Weinman, Nathaniel, Titus Barik, Steven M. Drucker, and Rob DeLine. Fork It: Supporting Stateful Alternatives in Computational Notebooks. (CHI2021). Moderator: Shurui Zhou