About SEA4DQ

Modern software systems are centered on data, using data on an increasing scale and in novel and intelligent ways. Key drivers for increased data availability include the Internet of Things (IoT), data sharing platforms, as well as open data portals. Data quality is crucial, as the data acquired and used by modern software systems strongly impacts on the reliability, robustness, efficiency, and trustworthiness of these systems.

How can software engineering and artificial intelligence (AI) help manage and tame data quality issues?

This is the question we aim to investigate in the workshop SEA4DQ. The SEA4DQ 2023 workshop is the third workshop of the series and discusses novel software engineering and AI techniques that address data quality issues of modern systems. The workshop provides a forum for researchers and practitioners to exchange ideas, lessons learnt, and visions for the future.

Topics of Interest

  • Requirements engineering for data quality
  • Software architectures and frameworks for data quality management
  • Software and AI solutions for data ingestion, collection, and acquisition for the sake of data quality
  • Software and AI solutions for Data pre-processing, filtering, labelling, cleaning for the sake of data quality
  • Software tools for data quality management, testing, and profiling
  • Quantification of data quality
  • Case studies evaluating existing data quality technique on real systems
  • Trade-off between data quality, performance, and accuracy

Invited Speakers

...
Lei Ma
Associate Professor
The University of Tokyo, Japan
University of Alberta, Canada
...
Ashish Tiwari
Principal Researcher, Microsoft
...
Rajdeep Mukherjee
Applied Scientist, Amazon

Keynote

...

Associate Professor - The University of Tokyo, Japan
University of Alberta, Canada

"Data Quality Assurance in the Era of Data-Driven Intelligence: Challenges and Opportunities"

Data quality assurance has been long recognized as a notoriously challenging task, which mainly relies on manual design and task-specific measurement criteria across diverse domains. Until recently, manual inspection and modification still play a dominant role throughout the lifecycle of data quality management in many commercial companies, which however, could not satisfy the increasing quality demands of requirements as a data-driven software system in this speed era. In this talk, I would give a high-level introduction to the state-of-the-art activities in advancing data quality assurance by AI-driven solutions and frameworks. I will also highlight the challenges, opportunities, and possible future directions.

Lei Ma is currently an Associate Professor with the honorary title of Excellent Researcher at The University of Tokyo; as well as an associate professor and Canada CIFAR AI Chair with University of Alberta. He is a Fellow with Amii — Alberta Machine Intelligence Institute and Director with the Center of Excellence on Trustworthiness Assurance and Engineering of Intelligent Vehicle at Autoware Foundation. His research centers around the interdisciplinary fields of human-centered trustworthy software engineering (SE), artificial intelligence (AI), and cyber-physical system (CPS) with a special focus on the quality, reliability, safety and security assurance, as well as the interpretation and human interactivity of machine learning and AI Systems. Many of his works were published in top-tier software engineering and AI venues (e.g., TSE, TOSEM, ICSE, FSE, ASE, CAV, ICML, NeurIPS, ACM MM, AAAI, IJCAI, TDSC). He has received more than 10 prestigious academic awards, including three ACM SIGSOFT Distinguished Paper Awards.

Schedule

December 4, 2023 - All times are in San Francisco local time (GMT-7)

Start - End Topic Presenters
09:00 - 09:15 Welcome, Objectives and Agenda Organization Committee
09:15 - 10:15 Keynote: Data Quality Assurance in the Era of Data-Driven Intelligence: Challenges and Opportunities Lei Ma
10:15 - 10:30 Break
10:30 - 11:00 Industry Talk: Data Quality at Microsoft Ashish Tiwari
11:00 - 11:30 Paper Presentation: Enhancing Data Quality in Large-Scale Software Systems for Industrial Automation Valentina Golendukhina
11:30 - 12:00 Paper Presentation: Data Pre-Processing and Sensor-Fusion for Multivariate Statistical Process Control of an Extrusion Process Frank Westad
12:00-13:30 Lunch Break
13:30 - 14:00 Industry talk: Data Quality in AWS Codewhisperer Rajdeep Mukherjee
14:00 - 14:30 Paper Presentation: A Nearest Neighbor-Based Concept Drift Detection Strategy for Reliable Tool Condition Monitoring Nicolas Jourdan
14:30 - 15:00 Paper Presentation: Challenges for Predictive Quality in Multi-Stage Manufacturing: Insights from Literature Review Beatriz Cassoli
15:00 - 15:15 Break
15:15 - 15:45 Invited Talk: Text Quality and Text Diversity Metrics for NLP Applications Fabrice Harel-Canada
15:45 - 16:30 Panel Discussion Organization Committee
16:30 Closing Organization Committee

Registration can be done at the FSE conference website.

Organization Committee

...
Foutse Khomh (Main Contact)
General Chair
Polytechnique Montreal, Canada
foutse.khomh@polymtl.ca
...
Preetha Chatterjee
Program Co-Chair
Drexel University, USA
...
Andreas Metzger
Program Co-Chair
University of Duisburg-Essen, Germany
...
Beatriz Cassoli
Industrial Co-Chair
TU Darmstadt, Germany

...
Sagar Sen
Industrial Co-Chair
SINTEF, Norway
...
Nicolas Jourdan
Web Chair
TU Darmstadt, Germany
...
Phu Nguyen
Publicity Co-Chair
SINTEF, Norway
...
Valentina Golendukhina
Publicity Co-Chair
University of Innsbruck, Austria


Program Committee*

  • Christoph Elsner, Siemens AG, Germany
  • Dietmar Winkler, TU Vienna, Austria
  • Enrique Garcia Ceja, U Monterrey, Mexico
  • Ezequiel Scott, University of Tartu, Estonia
  • Felix Mannhardt, Eindhoven University of Technology/KIT-AR, Netherlands
  • Freddy Munoz, Flatiron Health, USA
  • Heng Li, École Polytechnique de Montréal, Canada
  • Helena Holmström Olsson, Malmö University, Sweden
  • Hong-Linh Truong, Aalto University, Finland
  • Jan Nygård, Cancer Registry of Norway, Norway
  • Joanna Santos, Notre Dame University, USA
  • Katinka Wolter, Free University of Berlin, Germany
  • Lei Ma, University of Tokyo, Japan
  • Mehdi Mirakhorli, Rochester, USA
  • Meredith Lee, Berkeley, USA
  • Michael Felderer, University of Innsbruck, Austria
  • Oscar Dieste, Universidad Politecnica de Madrid, Spain
  • Rabe Abdalkareem, Concordia University, Canada
  • Sami Hyrynsalmi, LUT University, Finland
  • Sudipto Ghosh, Colorado State University, USA
* PC members list is in alphabetical order.

Download CfP

Download Call for Papers

The SEA4DQ 2023 Workshop is sponsored by the following research projects: