Kernel Support for Faster Web Proxies

This paper proposes two mechanisms for reducing the communication-related overheads of Web applications. One mechanism is 'user-level connection tracking', which allows an application to coordinate its non-blocking I/O operations with significantly fewer system calls than previously possible. The other mechanism is 'data-stream splicing', which allows a Web proxy application to forward data between server and client streams in the kernel with no restrictions on connection persistency, object cacheability, and request pipelining. These mechanisms remove elements that scale poorly with CPU speed, such as context switches and data copies, from the code path of Web-request handling.
The two mechanisms are implemented as Linux loadable kernel modules. User-level connection tracking is used to implement uselect, a user-level select API. The Squid Web proxy and the Polygraph benchmarking tool are used in the evaluation. With Polymix-4, a realistic forward proxy workload biased towards cache hits and small files, the reductions in CPU overheads relative to the original Squid (with select) are 52-72\% for uselect, up to 12\% for splice, and 58-78\% for the two mechanisms combined. Relative to Squid with /dev/epoll, uselect provides 50\% overhead reduction.

By: Marcel Rosu, Daniela Rosu

Published in: General Track 2003 USENIX Annual Technical Conference. Berkeley, CA, , USENIX Assoc. , p.225-38 in 2003

Please obtain a copy of this paper from your local library. IBM cannot distribute this paper externally.

Questions about this service can be mailed to reports@us.ibm.com .