Gabriel Campero Durand
M.Sc. Gabriel Campero Durand
AG Datenbanken & Software Engineering
Gabriel Campero Durand is a PhD student and associated researcher at the Databases and Software Engineering group of the Otto-von-Guericke University of Magdeburg.
He received his M.Sc. degree in Data and Knowledge Engineering at the University of Magdeburg in 2017. Before his current role he worked with IBM Research, and IBM Cloud Availability Monitoring in Boblingen.
His research focuses on production-ready applications of AI techniques to data management, with a focus on deep reinforcement learning.
Abgeschlossene Projekte
COOPeR: Cross-device OLTP/OLAP PRocessing
Laufzeit: 01.09.2016 bis 30.06.2021
Heutzutage stehen Datenbanksysteme vor zwei Herausforderungen. Auf der einen Seite müssen Datenbanksysteme Online-Transaction-Processing (OLTP) und Online-Analytical-Processing (OLAP) kombinieren, um Echtzeitanalysen von Geschäftsprozessen zu ermöglichen. Die Echtzeitanalysen von Geschäftsprozessen ist notwendig, um die Qualität der erstellten Berichte und Analysen zu verbessern, weil aktuelle Daten für die Analyse verwendet werden an Stelle von historischen Daten,die in traditionellen OLAP-Systemen verarbeitet werden. Auf der anderen Seite, werden Computersysteme zunehmend heterogener, um bessere Hardware-Leistung bereitzustellen. Die Architektur wechselt hierbei von Computersystemen mit Single-Core- CPUs zu Multi-Core-CPUs unterstützt von Ko-Prozessoren. Datenbanksysteme müssen beide Trends berücksichtigen, um die Qualität der Systeme zu verbessern, um die Leistung zu erhöhen, und um sicherzustellen, dass Datenbanksysteme künftigen Anforderungen (z.B. komplexere Anfragen oder erhöhte Datenvolumen) genügen.Leider konzentrieren sich aktuelle Forschungsansätze, jeweils nur auf eine der beiden Herausforderungen, entweder auf die Kombination von OLTP und OLAP Workloads in traditionellen CPU-basierte Systeme oder auf Ko-Prozessor-Beschleunigung für einen einzigen Workload-Typ. Daher gibt es keinen ganzheitlichen Ansatz der beide Herausforderungen berücksichtigt. In diesem Projekt wollen wir beide Herausforderungen von Datenbanksystemen berücksichtigen, um eine effiziente Verarbeitung von kombinierten OLTP/ OLAP-Workloads in hybriden CPU/Ko-Prozessor-Systemen zu ermöglichen. Dies ist notwendig, um Echtzeit-Business-Intelligence zu realisieren. Die größte Herausforderung ist hierbei die ACID-Eigenschaften für OLTP und kombinierten OLTP/OLAP-Workloads in hybriden Systemen zu gewährleisten, und gleichzeitig eine effiziente Verarbeitung der kombinierten Workloads zu ermöglichen.
2023
- Harish Kumar Harihara Subramanian, Bala
Gurumurthy, Gabriel Campero Durand, David Broneske, and Gunter Saake.
Out-of-the-Box Library Support for DBMSOperations On GPUs.
Distributed and Parallel Databases (DAPD), April
2023.
- Bala Gurumurthy, David Broneske,
Gabriel Campero Durand, Thilo Pionteck, and Gunter Saake.
ADAMANT: A Query Executor with Plug-In Interfaces for
Easy Co-processor Integration.
In IEEE International Conference on Data Engineering (ICDE), April
2023.
- Paul Blockhaus, Gabriel Campero Durand,
David Broneske, and Gunter Saake.
Towards a Future of Fully Self-Optimizing
Query Engines.
In 34. Workshop Grundlagen von Datenbanken,
2023.
2021
- Sepideh Sobhgol, Gabriel Campero
Durand, Lutz Rauchhaupt, and Gunter Saake.
Machine Learning within a Graph Database: A Case Study on Link
Prediction of Academic Data.
ICEIS, April 2021.
Accepted.
- Harish Kumar Harihara Subramanian, Bala
Gurumurthy, Gabriel Campero Durand, David Broneske, and Gunter Saake.
Analysis of GPU-Libraries for
rapid Prototyping Database Operations.
In Proceedings of the International Workshop on Big Data Management on
Emerging Hardware (HardBD), pages 36–41, April
2021.
- Anh Trang Le, Gabriel Campero Durand,
Bala Gurumurthy, David Broneske, Christoph Steup, and Gunter Saake.
Design Considerations Towards AI-Driven Co-Processor
Accelerated Database Management.
In Grundlagen von Datenbanken (GvDB), April 2021.
Accepted.
2020
- Xiao Chen, Nishanth Entoor
Venkatarathnam, Kirity Rapuru, David Broneske, Gabriel Campero Durand, Roman
Zoun, and Gunter Saake.
Analysis and
Comparison of Block-Splitting-Based Load Balancing Strategies for Parallel
Entity Resolution.
In International Conference on Information Integration and Web-based
Applications & Services (iiWAS2020), page 446–455. ACM, November
2020.
- Gabriel Campero Durand, Anshu Daur,
Vinayak Kumar, Shivalika Suman, Altaf Mohammed Aftab, Sajad Karim, Prafulla
Diwesh, Chinmaya Hegde, Disha Setlur, Syed Md Ismail, David Broneske, and
Gunter Saake.
Spread the good
around! Information Propagation in Schema Matching and Entity Resolution for
Heterogeneous Data.
Second Workshop on Data Integration to Knowlege Graphs, DI2KG
2020@VLDB, August 2020.
DI2KG Challenge Winner Paper.
(PDF)
- Marcus Pinnecke, Gabriel Campero, David
Broneske, Roman Zoun, and Gunter Saake.
GridTables: A One-Size-Fits-Most H2TAP Data
Store.
Datenbank-Spektrum, Volume 2020/01/31,
2020.
2019
- Sabine Wehnert, Gabriel Campero Durand,
and Gunter Saake.
ERST:
Leveraging Topic Features for Context-Aware Legal Reference
Linking.
In Michał Araszkiewicz and Víctor Rodríguez-Doncel, editors,
Legal Knowledge and Information Systems, volume 322 of
Frontiers in Artificial Intelligence and Applications, pages 113
–122. IOS Press, December 2019.
- Xiao Chen, Yinlong Xu, David Broneske,
Gabriel Campero Durand, Roman Zoun, and Gunter Saake.
Heterogeneous
Committee-Based Active Learning for Entity Resolution (HeALER).
In European Conference on Advances in Databases and Information Systems
(ADBIS), LNCS, pages 69–85, September 2019.
- Gabriel Campero Durand, Rufat Piriyev,
Marcus Pinnecke, David Broneske, and Gunter Saake.
Automated Vertical Partitioning
with Deep Reinforcement Learning.
European Conference on Advances in Databases and Information
Systems, September 2019.
(PDF)
- Gabriel Campero Durand.
Production-Ready Learning-Augmented Data
Management with Deep Reinforcement Learning.
Small Entry Accepted for ECML PKDD Workshop, Unpublished, September
2019.
- Rutuja Pawar, Sepideh Sobhgol, Gabriel
Campero Durand, Marcus Pinnecke, David Broneske, and Gunter Saake.
Codd's World: Topics and their Evolution in the Database
Community Publication Graph.
In Grundlagen von Datenbanken, volume 2367, pages 1–6, June
2019.
- Xiao Chen, Gabriel Campero Durand,
Roman Zoun, David Broneske, Yang Li, and Gunter Saake.
The Best of Both Worlds: Combining Hand-Tuned and Word-Embedding-Based
Similarity Measures for Entity Resolution.
In Datenbanksysteme für Business, Technologie und Web, pages
215 – 224, March 2019.
- Marcus Pinnecke, Gabriel Campero, Roman
Zoun, David Broneske, and Gunter Saake.
Protobase: It’s About Time for Backend/Database
Co-Design.
In Holger Meyer, Norbert Ritter, Andreas Thor, Daniela Nicklas, Andreas Heuer,
and Meike Klettke, editors, BTW 2019 – Workshopband, volume
P-289 of Lecture Notes in Informatics (LNI), pages 515–518.
Gesellschaft für Informatik, March 2019.
- Marcus Pinnecke, Gabriel Campero, Roman
Zoun, David Broneske, and Gunter Saake.
Protobase: It's About Time for Backend/Database Co-Design.
In Datenbanksysteme für Business, Technologie und Web (BTW),
pages 515–518, 2019.
2018
- Iya Arefyeva, David Broneske, Gabriel
Campero, Marcus Pinnecke, and Gunter Saake.
Memory Management Strategies in CPU/GPU Database Systems:
A Survey.
In BDAS. Springer, September 2018.
- Bala Gurumurthy, David Broneske, Marcus
Pinnecke, Gabriel Campero Durand, and Gunter Saake.
SIMD Vectorized Hashing for Grouped
Aggregation.
In Advances in Databases and Information Systems, pages 113 –
126, September 2018.
- Roman Zoun, Gabriel Campero Durand, Kay
Schallert, Apoorva Patrikar, David Broneske, Wolfram Fenske, Robert Heyer,
Dirk Benndorf, and Gunter Saake.
Protein Identification as a Suitable Application for Fast Data
Architecture.
In International Workshop on Biological Knowledge Discovery and Data
Mining (BIOKDD-DEXA), pages 168 – 178. IEEE, September
2018.
- Roman Zoun, Kay Schallert, Atin Janki,
Rohith Ravindran, Gabriel Campero Durand, Wolfram Fenske, David Broneske,
Robert Heyer, Dirk Benndorf, and Gunter Saake.
Streaming FDR Calculation for Protein Identication.
In Advances in Databases and Information Systems, pages 80 – 87,
September 2018.
- Xiao Chen, Kirity Rapuru,
Gabriel Campero Durand, and Eike Schallehn.
Performance Comparison of Three Spark-Based
Implementations of Parallel Entity Resolution.
In International Workshop on Big Data Management in Cloud Systems
(BDMICS-DEXA), pages 76–87. Springer, September
2018.
- Iya Arefyeva, Gabriel Campero Durand,
Marcus Pinnecke, David Broneske, and Gunter Saake.
Low-Latency Transaction Execution
on Graphics Processors: Dream or Reality?.
Ninth International Workshop on Accelerating Analytics and Data
Management Systems Using Modern Processor and Storage Architectures
(ADMS), August 2018.
- Gabriel Campero Durand, Marcus
Pinnecke, Rufat Piriyev, Mahmoud Mohsen, David Broneske, Gunter Saake, Maya
Sekeran, Fabian Rodriguez, and Laxmi Balami.
GridFormation: Towards Self-Driven
Online Data Partitioning using Reinforcement Learning.
In First International Workshop on Exploiting Artificial Intelligence
Techniques for Data Management (aiDM), June 2018.
(PDF)
- Yusra Shakeel, Jacob Krüger, Ivonne
von Nostitz-Wallwitz, Christian Lausberger, Gabriel Campero Durand, Gunter
Saake, and Thomas Leich.
(Automated) Literature Analysis - Threats and
Experiences.
In International Workshop on Software Engineering for Science,
SE4Science, pages 20–27. ACM, May 2018.
- Gabriel Campero Durand, Jingyi Ma,
Marcus Pinnecke, and Gunter Saake.
Piecing together large puzzles, efficiently: Towards
scalable loading into graph database systems.
In Grundlagen von Datenbanken, May 2018.
- Gabriel Campero Durand, Anusha
Janardhana, Marcus Pinnecke, Yusra Shakeel, Jacob Krüger, Thomas Leich,
and Gunter Saake.
Exploring Large Scholarly Networks with Hermes.
In International Conference on Extending Database Technology,
EDBT, pages 650–653. OpenProceedings, March 2018.
2017
- Gabriel Campero Durand, Marcus
Pinnecke, David Broneske, and Gunter Saake.
Backlogs and
interval timestamps: Building blocks for supporting temporal queries in graph
databases.
In Proceedings of the Workshops of the EDBT/ICDT 2017 Joint Conference
(EDBT/ICDT 2017), Venice, Italy, March 21-24, 2017., volume 1810.
CEUR-WS, 2017.
(PDF)
- Marcus Pinnecke, David Broneske,
Gabriel Campero Durand, and Gunter Saake.
Are Databases Fit for
Hybrid Workloads on GPUs? A Storage Engine’s Perspective..
In Proceedings of the International Workshop on Big Data Management on
Emerging Hardware, San Diego, USA, April 22, 2017, pages 1599–1606,
2017.
(PDF)
- Gabriel Campero.
Best Practices for Developing Graph Database
Applications: A Case Study Using Apache Titan.
Master thesis, University of Magdeburg, Germany, January
2017.
- Learning-augmented (& production-ready) solutions for data management
- Data partitioning
- Join order optimitzation (cardinality estimation and plan-space search improvements)
- ...
- Management of machine learning in production
- Safety
- Interpretability
- Tooling
- Curricula design and sample efficiency
- Data management for ML
- ...
- Network analysis