1

IrisBench: An Open-Source Benchmark Suite for Video Processing Systems in Cloud

Recent advances in generative text-to-video AI models (e.g., VideoPoet and Sora) have spurred a surge in video production, leading to an increased demand for video processing pipelines among various video service providers such as YouTube and TikTok. …

Grunt Attack: Exploiting Execution Dependencies in Microservices

Loosely-coupled and lightweight microservices running in containers are likely to form complex execution dependencies inside the system. The execution dependency arises when two execution paths partially share component microservices, resulting in …

Sync-Millibottleneck Attack on Microservices Cloud Architecture

The modern web services landscape is characterized by numerous fine-grained, loosely coupled microservices with increasingly stringent low-latency requirements. However, this architecture also brings new performance vulnerabilities. In this paper, we …

μConAdapter: Reinforcement Learning-based Fast Concurrency Adaptation for Microservices in Cloud

Modern web-facing applications such as e-commerce comprise tens or hundreds of distributed and loosely coupled microservices that promise to facilitate high scalability. While hardware resource scaling approaches [28] have been proposed to address …

A BlackBox Approach to Profile Runtime Execution Dependencies in Microservices

Loosely-coupled and lightweight microservices running in containers are likely to form complex execution dependencies inside the system. The execution dependency arises when two execution paths partially share component microservices, resulting in …

Sora: A Latency Sensitive Approach for Microservice Soft Resource Adaptation

Fast response time for modern web services that include numerous distributed and lightweight microservices becomes increasingly important due to its business impact. While hardware-only resource scaling approaches (e.g., FIRM [47] and PARSLO [40]) …

ShadowSync: Latency Long Tail caused by Hidden Synchronization in Real-time LSM-tree based Stream Processing Systems

Mission-critical, real-time, continuous stream processing applications that interact with the real world have stringent latency requirements. For example, e-commerce websites like Amazon improve their marketing strategy by performing real-time …

A Functional Model and Analysis of Next Generation Malware Attacks and Defenses

Recent ransomware attacks (e.g., Colonial and JBS) caused significant social and economic impact due to their ability to shut down entire businesses. A functional model divides nextgeneration malware (NG-malware) attacks into 3 stages: Penetration …

DoubleFaceAD:A New Datastore Drive Architecture to Optimize Fanout Query Performance

The broad adoption of fanout queries on distributed datastores has made asynchronous event-driven datastore drivers a natural choice due to reduced multithreading overhead. However, through extensive experiments using the latest datastore drivers …

Mitigating Large Response Time Fluctuations through Fast Concurrency Adapting in Clouds

Dynamically reallocating computing resources to handle bursty workloads is a common practice for web applications (e.g., e-commerce) in clouds. However, our empirical analysis on a standard n-tier benchmark application (RUBBoS) shows that simply …