Project Reports
- Final Project Report (2023)
- Midterm Project Report (2020)
PhD Dissertations
-
Lars Kroll - Compile-time Safety and Runtime Performance in Programming Frameworks for Distributed Systems (2020)
-
Theodore Vasiloudis - Scalable Machine Learning through Approximation and Distributed Computing (2019)
-
Paris Carbone - Scalable and Reliable Data Stream Processing (2018)
-
Salman Niazi - Scaling Distributed Hierarchical File Systems Using NewSQL Databases (2018)
-
Jingna Zeng - Augmenting Transactional Memory with the Future Abstraction (2020)
Posters
-
Paris Carbone, Lars Kroll, Klas Segeljakt, Max Meldrum, Adam Hasselberg, Christian Schulte, Seif Haridi - Continuous Deep Analytics (CDA)
-
Klas Segeljakt, Frej Drejhammar - Using MLIR to implement a compiler for Arc, a language for Batch and Stream Programming
MSc Theses
-
Ashmitha Ambastha - The State of Stateful Serverless Systems
-
Luca Staboli - Trusted Execution Environment deployment through cloud Virtualization: Aproject on scalable deployment of virtual machines
-
Christopher Gustafson - Improving Availability of Stateful Serverless Functions in Apache Flink
-
Ao Song - Tiered Storage Architecture for Stream Processing Systems
-
Mikolaj Robakowski - Comparison of State Backends for Modern Stream Processing System
-
Massimo Perini - Dynamic Graph Embedding on Event Streams with Apache Flink
-
Max Meldrum - Hardware Utilisation Techniques for Data Stream Processing
-
Klas Segeljakt - A Scala DSL for Rust code generation
-
Oscar Bjuhr - Relocatable Driver and Code Generator for Continuous Deep Analytics
-
Johan Mickos - Design of a Network Library for Continuous Deep Analytics
-
Tobias Lindener - Enabling Arbitrary Memory Constraint Standing Queries on Distributed Stream Processors using Approximate Algorithms
Conferences, Journals and Workshops
-
H Ng, S Haridi, P Carbone. Omni-Paxos: Breaking the Barriers of Partial Connectivity - EuroSys’23
-
H Ng, K Wu, P Carbone. UniCache: Efficient Log Replication through Learning Workload Patterns - EDBT2023
-
J Spenger, C Huang, P Haller, P Carbone - Portals: A Showcase of Multi-Dataflow Stateful Serverless - PVLDB 2023.
-
Vasiloudis, Theodore, Foteini Beligianni, and Gianmarco De Francisci Morales - BoostVHT: Boost- ing Distributed Streaming Decision Trees Proceedings of the 2017 ACM Conference on Information and Knowledge Management ACM, (2017)
-
J Zeng, P Romano, J Barreto, L Rodrigues, S Haridi - Online Tuning of Parallelism Degree in Parallel Nesting Transactional Memory IEEE International Parallel and Distributed Processing Symposium (IPDPS 2018)
-
M Ismail, M Ronström, S Haridi, J Dowling - ePipe: Near Real-Time Polyglot Persistence of HopsFS Metadata 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing 2019
-
Kefato, Girdzijauskas et al. Context-sensitive node representations - Gossip and Attend: Context-Sensitive Graph Representation Learning ICWSM 2020
-
Antaris, Girdzijauskas et al. EGAD: Evolving Graph Representation Learning with Self-Attention and Knowledge Distillation for Live Video Streaming Events IEEE Big Data 2020
-
Lee, Girdzijauskas et al. Z-Embedding: A spectral representation of event intervals for efficient clustering and classification ECML/PKDD 2020
-
S Horchidan, P Carbone - ORB: Empowering Graph Queries through Inference - DMKG @ ESWC 2023.
-
J Spenger, P Carbone, P Haller. Portals: An Extension of Dataflow Streaming for Stateful Serverless - Onward! 2022, SPLASH 2022
-
S Horchidan, E Kritharakis, B Kalavri, P Carbone - Evaluating model serving strategies over streaming data - DEEM @ SIGMOD 2022 (Best Paper Award)
-
M Zwolak, Z Abbas, S Horchidan, P Carbone, V Kalavri - GCNSplit: Bounding the State of Streaming Graph Partitioning - aiDM @ SIGMOD 2022
-
J Spenger, P Carbone, P Haller. Wip: Pods: Privacy compliant scalable decentralized data services - Poly @ VLDB 2021
-
P Carbone, M Fragkoulis, V Kalavri, A Katsifodimos - Beyond Analytics: the Evolution of Stream Processing Systems - ACM SIGMOD 2020 (draft)
-
H Boström, U Johansson – Mondrian Conformal Regressors - COPA 2020
-
L Karlsson, H Boström, P Zieger – Classification of Aerosol Particles using Inductive Conformal Prediction - COPA (2020)
-
H Werner, L Carlsson, E Ahlberg, H Boström – Evaluating Different Approaches to Calibrating Conformal Predictive Systems - COPA 2020
-
H Bostroöm, P Höglund, S-O Junker, A-S Oberg, M Sparr – Explaining Multivariate Time Series Forecasts: an Application to Predicting the Swedish GDP, XI-ML (2020)
-
T Vasiloudis, H Cho, H Boström – Block-distributed Gradient Boosted Trees, Workshop on Systems for ML at NeurIPS (2019)
-
Max Meldrum, Klas Segeljakt, Lars Kroll, Paris Carbone, Christian Schulte, Seif Haridi - Arcon: Continuous and Deep Data Stream Analytics - BIRTE @ VLDB 2019 (preprint)
-
Lars Kroll, Klas Segeljakt, Paris Carbone, Christian Schulte, Seif Haridi - Arc: An IR for Batch and Stream Programming - DBPL @ PLDI 2019 (preprint)
-
Theodore Vasiloudis, Gianmarco De Francisci Morales, and Henrik Boström. - Quantifying Uncertainty in Online Regression Forests. - Journal of Machine Learning Research 2019 (preprint)
-
Theodore Vasiloudis, Hyunsu Cho, Henrik Boström - Block-distributed Gradient Boosted Trees - ACM SIGIR 2019 (Best Short Paper Award) (preprint)
-
Salman Niazi, Mikael Ronström. Seif Haridi, Jim Dowling - Size Matters: Improving the Performance of Small Files in Hadoop - ACM Middleware 2018
-
Z Abbas, V Kalavri, P Carbone, V Vlassov - Streaming graph partitioning: an experimental study - VLDB 2018
-
Vasiliki Kalavri, Vladimir Vlassov, Seif Haridi - High-level programming abstractions for distributed graph processing-IEEE Transactions in Knowledge Data Engineering (TKDE Journal) 2018
-
Sherif Sakr, Tilmann Rabl, Martin Hirzel , Paris Carbone, Martin Strohbach - Dagstuh Seminar on Big Stream Processing SIGMOD Record 2018
-
Lars Kroll, Paris Carbone, Seif Haridi - Kompics Scala: narrowing the gap between algorithmic specification and executable code (ACM SIGPLAN Symposium on Scala 2017)
-
Carbone, P., Ewen, S., Fóra, G., Haridi, S., Richter, S., & Tzoumas, K. - State management in Apache Flink®: consistent stateful distributed stream processing. Proceedings of the VLDB Endowment (2017)
-
Muhammad Anis Uddin Nasir, Aristides Gionis, Gianmarco De Francisci Morales, Sarunas Girdzijauskas – Fully Dynamic Algorithm for Top-k Densest Subgraphs - CIKM 2017
-
Kambiz Ghoorchian, Sarunas Girdzijauskas, Fatemeh Rahimian – DeGPar: Large Scale Topic - Detection Using Node-Cut Partitioning on Dense Weighted Graphs. - ICDCS 2017
-
Lars Kroll, Jim Dowling, Seif Haridi - Static Type Checking for the Kompics Component Model: Kola – The Kompics Language - PMLDC @ ECOOP 2016
-
Carbone, P., Katsifodimos, A., Ewen, S., Markl, V., Haridi, S., & Tzoumas, K. . - Apache flink: Stream and batch processing in a single engine. Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, (2015)
Technical Reports and Surveys
-
Paris Carbone, Asterios Katsifodimos, Seif Haridi - Stream Window Aggregation Semantics and Optimization Encyclopedia of Big Data Technologies (2019)
-
S Haridi, L Kroll, P Carbone - Lecture Notes on Leader-based Sequence Paxos – An Understandable Sequence Consensus Algorithm arXiv preprint (2020)
-
M Fragkoulis, P Carbone, V Kalavri, A Katsifodimos – A Survey on the Evolution of Stream Processing Systems, arXiv preprint (2020)
Talks and Events
-
Jonas Spenger - Crossing the Boundaries of Stateful Streaming and Actors using Serverless Portals - Scala Days Madrid 2023
-
Max Meldrum - Uncovering Arcon: A state-first Rust streaming analytics runtime - FOSDEM 22
-
Sruthi Kumar & Hasseb Asif – FlinkNDB: Skyrocketing Stateful Capabilities of Apache Flink, Flink Forward Global Event (Oct. 2020)
-
Sruthi Kumar & Hasseb Asif – NEXMark-Beam: Your best companion for testing and benchmarking new core stream processing libraries, Beam Summit (Aug. 2020)
-
Frej Drejhammar - Euro-LLVM Extending Clang and LLVM for Interpreter Profiling Perf-ection (Apr. 2020)
-
Klas Segeljakt - Euro-LLVM Experiences using LLVM to implement a custom language (Apr. 2020)
-
Klas Segeljakt - PLDS Arc: An MLIR dialect for Data Analytics (Mar. 2020)
-
Paris Carbone - PLDS Seamless Batch and Stream Computation on Heterogeneous Hardware with Arcon (Mar. 2020)
-
Paris Carbone - Chaos Engineering Workshop Reliable Stream Processing at Scale with Apache Flink (Dec. 2019)
-
Paris Carbone - NDSML Continuous Intelligence - Intersecting Event-Based Business Logic and ML (Oct. 2019)
-
Paris Carbone - CASTOR Software Days Continuous Intelligence through Computation Sharing (Oct. 2019)
-
Massimo Perini - Flink Forward - Deep Stream Dynamic Graph Analytics with Grapharis
-
Klas Segeljakt, Max Meldrum - Flink Forward - Introducing Arc: An IR for unified batch and streaming (Oct. 2019)
-
Paris Carbone - Stream Loops on Flink - Flink Forward Berlin (Nov. 2018)
-
Paris Carbone - Asynchronous Epoch Commits for Fast and Consistent Stateful Streaming with Apache Flink . LADIS/PODC (July 2018).
-
Paris Carbone - The Road to Continuous Deep Analytics - RISE Open House (Apr. 2018)