Web Proxy Acceleration

Numerous studies show that miss ratios at forward proxy are typically at least 40%-50%.
This paper proposes and evaluates a new approach for improving the throughput of Web
proxy systems by reducing the overhead of handling cache misses. Namely, we propose
to front-end a Web proxy with a high performance node that filters the requests, processing
the misses and forwarding the hits and the new cacheable content to the proxy. Requests
are filtered based on hints of the proxy cache content. This system, called Proxy Accelerator,
achieves significantly better communications performance than a traditional proxy system.
For instance, an accelerator can be built as an embedded system optimized for communication
and HTTP processing, or as a kernel-mode HTTP server. Scalability with the Web proxy
cluster size is achieved by using several accelerators. We use analytical models, trace-based
simulations, and a real implementation to study the benefits and the implementation tradeoffs
of this new approach. Our results show that a single proxy accelerator node in front of a four-node
Web proxy can improve the cost-performance ratio by about 40\%. Hint-based request filter
implementation choices that do not affect the overall hit ratio are available. An implementation of
the hint management module integrated in Web proxy software is presented. Experimental
evaluation of the implementation demonstrates that the associated overheads are very small.

By: Daniela Rosu, Arun Iyengar, Daniel Dias

Published in: RC21945 in 2001

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

rc21945.pdf

Questions about this service can be mailed to reports@us.ibm.com .