Date of Award
Doctor of Philosophy (PhD)
Parallel and distributed systems are pervasive, such as web services, clouds, and cyber-physical systems. We often desire high throughput and small latency in the parallel and distributed system. However, since the system is distributed and the input is online, scheduling for high throughput while keeping the latency small is often challenging. In this dissertation, we developed scheduling algorithms, policies, and mechanisms to approach high throughput with small latency in various parallel and distributed applications. First, we developed AMCilk runtime system for running multi-programmed parallel jobs on many-processor machines. When running parallel jobs, the allocation of processors to the parallel jobs often changes over time. Since AMCilk conducts responsive reallocation of processors between parallel jobs, various applications achieve significant throughput boosts and latency reductions compared to existing solutions. Second, we built static schedules for the real-time transmission of messages in shared communication media. Medium errors can cause transmission failures, and they are unpredictable. Since our schedules tolerate medium errors with low overhead, real-time applications can transmit messages at high throughput. Finally, we analyzed various data-placement strategies for distributed key-value stores. In distributed key-value stores, a large amount of data is stored across multiple servers. When a request to access data arrives, it is routed to the appropriate server, queued, and eventually processed. If the queue is full, then requests can be rejected. We analyze various data-placement strategies and compare their capacities of avoiding rejections and achieving high throughput with small queue sizes.
Kunal Agrawal, I-Ting Lee, Christopher Gill, Jeremy Fineman,
Available for download on Thursday, December 14, 2023