Abstract
The Web contains so much information that it is almost beyond measure. How do users manage the useful information that they have seen while screening out the rest that doesn't interest them? Bookmarks help, but bookmarking a page doesn't guarantee that it will be available forever. Search engines are becoming more powerful, but they can't be customized based on the access history of individual users. This paper suggests that a better alternative to managing web information is through a middleware approach based on iPROXY, a programmable proxy server. iPROXY offers a suite of archiving, retrieval, and searching services. It can extend a URL to include commands that archive and retrieve pages. Its modular architecture allows users to plug in new features without having to change existing browsers or servers. Once installed on a network, iPROXY can be accessed by users using different browsers and devices. Internet service providers who offer customers iPROXY will be free to develop new services without having to wait for the dominant browsers to be updated.
- {Archive 00} The Internet Archive: Building an Internet Library, http://www.archive.org, March 2000.Google Scholar
- {Barrett & Maglio 99} Rob Barrett, Paul P. Maglio, Intermediaries: New Places for Producing and Manipulating Web Content, Proceedings of the Seventh International World Wide Web Conference, Brisbane, Australia, 1998. Google ScholarDigital Library
- {Chen and Koutsofios 97} Yih-Farn Chen and Eleftherios Koutsofios, WebCiao: A Website Visualization and Tracking System, Proceedings of WebNet97, Toronto, Canada, October, 1997.Google Scholar
- {Cheswick and Bellovin 94} W. R. Cheswick and S. M. Bellovin, Firewalls and Internet Security: Repelling the Wily Hacker, Addison-Wesley, 1994. Google ScholarDigital Library
- {Davison 99} B. D. Davison, A Survey of Proxy Cache Evaluation Techniques, Proceedings of Fourth International Web Caching Workshop (WCW99), San Diego, March 1999.Google Scholar
- {Douglis et al. 97} Fred Douglis, Anja Feldmann, Balachander Krishnamurthy, and Jeffrey Mogul, Rate of Change and other Metrics: a Live Study of the World Wide Web, USENIX Symposium on Internet Technologies and Systems, December 1997, pp. 147-158. Google ScholarDigital Library
- {Douglis et al. 98} Fred Douglis, Thomas Ball, Yih-Farn Chen, and Eleftherios Koutsofios The AT&T Internet Difference Engine: Tracking and Viewing Changes on the Web, World Wide Web Journal, Vol. 1, No. 1, Baltzer Science Publishers, January, 1998, pp. 27-44. Google ScholarDigital Library
- {JigSaw 99} JigSaw - The W3C's Web Server, World Wide Web Consortium, http://www.w3.org/Jigsaw/Google Scholar
- {Lucent 99} Lucent Technologies, The Lucent Personalized Web Assistant, http://www.bell-labs.com/project/lpwa/system.html, 1999Google Scholar
- {PICS 99} Platform for Internet Content Selection, World Wide Web Consortium, http://www.w3.org/PICSGoogle Scholar
- {Luotonen 98} Ari Luotonen, Web Proxy Servers, Prentice Hall, 1998, pp. 213-225. Google ScholarDigital Library
- {Li et al. 99} Wen-Syan Li, Quoc Vu, Divakant Agrawal, Yoshinori Hara, and Hajime Takano, PowerBookmarks: a system for personalizable Web information organization, sharing, and management, Proceedings of the Eighth International World Wide Web Conference, Toronto, Canada, May 1999, pp. 297-311. Google ScholarDigital Library
- {Netscape 99} My Netscape, http://my.netscape.com, NetScape Communications Corp., January, 1999.Google Scholar
- {Rao et al. 99a} Herman Rao, Yih-Farn Chen, Ming-Feng Chen, and Josie Cheng, iProxy: A Programmable Proxy Server, Poster Proceedings of the WebNet99 Conference, Oct. 1999. Also, visit. http://www.research.att.cow/sw/tools/iproxy.Google Scholar
- {Rao et al. 99b} Herman Rao, Yih-Farn Chen, Ming-Feng Chen, and Josie Cheng, A Proxy-Based Personal Portal, Proceedings of the WebNet99 Conference, Hawaii, Oct. 1999.Google Scholar
- {URL 99} Uniform Resource Locators, World Wide Web Consortium, http://www.w3.org/Addressing/URL/Google Scholar
- {Yahoo 99} My Yahoo, http://my.yahoo.com, Yahoo Inc., January, 1999.Google Scholar
Index Terms
- A proxy-based personal web archiving service
Recommendations
Data quality in web archiving
WICOW '09: Proceedings of the 3rd workshop on Information credibility on the webWeb archives preserve the history of Web sites and have high long-term value for media and business analysts. Such archives are maintained by periodically re-crawling entire Web sites of interest. From an archivist's point of view, the ideal case to ...
Intelligent crawling of web applications for web archiving
WWW '12 Companion: Proceedings of the 21st International Conference on World Wide WebThe steady growth of the World Wide Web raises challenges regarding the preservation of meaningful Web data. Tools used currently by Web archivists blindly crawl and store Web pages found while crawling, disregarding the kind of Web site currently ...
Web 2.0 proxy: upgrading websites from web 1.0 to web 2.0
Since the term "Web 2.0" appears, a new generation of Web is coming. There are many articles referring to how to design a Web 2.0 website. However, the traditional Web 1.0 websites are still multitudinous currently. The developers of the traditional Web ...
Comments