Useful Web Sites

Journals, Magazines, and Workshops Meetings/Organizations: CHI, AAAI, IJCAI, SIGIR, KDD, WWW, and the Personalization Consortium

Other Hot-Points: www.personalization.com, www.kdnuggets.com

People: John Doyle, Jon Kleinberg, Alon Halevy, Dan Weld

Research Projects

Master List of References

  1. S. Abiteboul, P. Buneman, and D. Suciu, Data on the Web: From Relations to Semistructured Data and XML, Morgan Kaufmann Publishers, 2000.

  2. L. Adamic, The Small World Wide Web, URL: http://www.parc.xerox.com/istl/groups/iea/www/smallworldpaper.html.

  3. C.C. Aggarwal, J. Wolf, K. Wu and P. Yu, Horting Hatches an Egg: A Graph-Theoretic Approach to Collaborative Filtering, Proceedings of the Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 201-212, ACM Press, August 1999.

  4. W. Aiello, F. Chung, and L. Lu, A Random Graph Model for Massive Graphs, Proceedings of the ACM Symposium on Theory of Computing (STOC'2000), pages 171-180, ACM Press, 2000.

  5. L.A.N. Amaral, A. Scala, M. Barthelemy, and H.E. Stanley, Classes of Behavior of Small-World Networks, cond-mat/0001458, January 2000.

  6. B. Amento, L. Terveen, and W. Hill, Does `Authority' mean Quality? Predicting Expert Quality Ratings of Web Documents, In Proceedings of the 23rd SIGIR, 2000.

  7. C. Anderson, A. Levy, and D. Weld, Web-Site Management with Tiramisu, In Proceedings of the Web/DB Workshop, SIGMOD 1999, 1999.

  8. N. Ashish and C. Knoblock, Wrapper Generation for Semi-Structured Internet Sources, ACM SIGMOD Record, December 1997.

  9. M. Balabanovic and Y. Shoham, Fab: Content-Based, Collaborative Recommendation, Communications of the ACM, Vol. 40, No. 3, pages 66-72, March 1997.

  10. N.J. Belkin, Helping People Find What They Don't Know, Communications of the ACM, Vol. 43, No. 8, pages 59-61, August 2000.

  11. N.J. Belkin and W.B. Croft, Information Filtering and Information Retrieval: Two Sides of the Same Coin?, Communications of the ACM, Vol. 35, No. 12, pages 29-38, December 1992.

  12. M.W. Berry, Z. Drmac, and E.R. Jessup, Matrices, Vector Spaces, and Information Retrieval, SIAM Review, Vol. 41, No. 2, pages 335-362, 1999.

  13. M.W. Berry, S.T. Dumais, and G.W. O'Brien, Using Linear Algebra for Intelligent Information Retrieval, SIAM Review, Vol. 37, No 4, pages 573-595, 1995.

  14. D. Billsus, and M. Pazzani, Learning Collaborative Information Filters, Proceedings of the Fifteenth International Conference on Machine Learning, pages 46-53, Morgan Kaufmann, 1998.

  15. D. Binkley and K. Gallagher, Program Slicing, Advances in Computers, Vol. 43, 1996.

  16. A. Booker et al., Visualizing Text Datasets, IEEE Computing in Science and Engineering, Vol. 1, No. 4, pages 26-34, July/August 1999.

  17. J. Breese, D. Heckerman, and C. Kadie, Empirical Analysis of Predictive Algorithms for Collaborative Filtering, Proceedings of the Fourteenth Annual Conference on Uncertainty in Artificial Intelligence, pages 43-52, Morgan Kaufmann, July 1998.

  18. S. Brin and L. Page, The Anatomy of a Large-Scale Hypertextual Web Search Engine, Proceedings of the Seventh International World Wide Web Conference, pages 107-117, April 1998.

  19. A. Broder et al., Graph Structure in the Web, In Proceedings of the International World Wide Web Conference, 1999.

  20. J. Callan, Searching for Needles in a World of Haystacks, IEEE Data Engineering Bulletin, Volume 23, Number 3, pages 33-67, September 2000.

  21. S. Chakraborti, B.E. Dom, S. Ravi Kumar, P. Raghavan, S. Rajagopalan, A. Tomkins, D. Gibson, and J. Kleinberg, Mining the Web's Link Structure, IEEE Computer, Vol. 32, No. 8, pages 60-67, August 1999.

  22. W.W. Cohen, A. McCallum, and D. Quass, Learning to Understand the Web, IEEE Data Engineering Bulletin, Volume 23, Number 3, pages 17-24, September 2000.

  23. M. Fernandez, D. Florescu, J. Kang, A. Levy, and D. Suciu, Catching the Boat with Strudel: Experience with a Web-Site Management System, Proc. ACM SIGMOD, 1998.

  24. M. Faloutsos, P. Faloutsos, and C. Faloutsos, On Power-Law Relationships of the Internet Topology, Proceedings of the ACM SIGCOMM Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, pages 251-262, September 1999.

  25. G.W. Flake, S. Lawrence and C. Lee Giles, Efficient Identification of Web Communities, Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 150-160, ACM Press, August 2000.

  26. D. Florescu, A. Levy, and A. Mendelzon, Database Techniques for the World-Wide Web: A Survey, ACM SIGMOD Record, Vol. 27, No. 3, pages 59-74, September 1998.

  27. P.W. Foltz and S.T. Dumais, Personalized Information Delivery: An Analysis of Information Filtering Methods, Communications of the ACM, Vol. 35, No. 12, pages 51-60, 1992.

  28. Y. Fruend, R. Iyer, R. Schapire, and Y. Singer, An Efficient Boosting Algorithm for Combining Preferences, Proceedings of the Fifteenth International Conference on Machine Learning, pages 170-178, Morgan Kaufmann, July 1998.

  29. M. Garofalakis, A. Gionis, R. Rastogi, S. Seshadri, and K. Shim, XTRACT: A System for Extracting Document Type Descriptors from XML Documents, Proc. ACM SIGMOD, 2000.

  30. D. Gibson, J. Kleinberg, and P. Raghavan, Clustering Categorical Data: An Approach Based on Dynamical Systems, In Proceedings of VLDB'1998, pages 311-312, 1998.

  31. C.H. Goh, S. Bressan, S. Madnick, and M. Siegel, Context Interchange: New Features and Formalisms for the Intelligent Integration of Information, ACM Transactions on Information Systems, Vol. 17, No. 3, pages 270-293, 1999.

  32. D. Goldberg, D. Nichols, B. Oki, and D. Terry, Using Collaborative Filtering to Weave an Information Tapestry, Communications of the ACM, Vol. 35, No. 12, pages 61-70, December 1992.

  33. K. Goldberg, Jester: The On-Line Joke Recommender, URL: http://shadow.ieor.berkeley.edu/humor.

  34. K. Goldberg, T. Roeder, D. Gupta, and C. Perkins, Eigentaste: A Constant Time Collaborative Filtering Algorithm, Technical Report M00/41, Electronic Research Laboratory, University of California, Berkeley, August 2000.

  35. N. Good, J. Schafer, J. Konstan, A. Borchers, B. Sarwar, J. Herlocker, and J. Riedl, Combining Collaborative Filtering with Personal Agents for Better Recommendations, Proceedings of the Sixteenth National Conference on Artificial Intelligence, pages 439-446, AAAI/MIT Press, July 1999.

  36. P. Gray, P. King, and L. Kerschberg, Functional Approach to Intelligent Information Systems, Journal of Intelligent Information Systems, 2000.

  37. B. Hayes, Graph Theory in Practice: Part I, American Scientist, Vol. 88, No. 1, pages 9-13, 2000.

  38. B. Hayes, Graph Theory in Practice: Part II, American Scientist, Vol. 88, No. 2, pages 104-109, 2000.

  39. D. Heckerman, D.M. Chickering, C. Meek, R. Rounthwaite, and C. Kadie, Dependency Networks for Inference, Collaborative Filtering, and Data Visualization, Journal of Machine Learning Research, Vol. 1, pages 49-75, October 2000.

  40. M.R. Henzinger, Link Analysis in Web Information Retrieval, IEEE Data Engineering Bulletin, Volume 23, Number 3, pages 3-8, September 2000.

  41. T. Hoffman and J. Puzicha, Latent Class Models for Collaborative Filtering, Proceedings of the 16th International Joint Conference on Artificial Intelligence, 1999.

  42. E. Housman and E. Kaskela, State of the Art in Selective Dissemination of Information, IEEE Transactions of Engineering Writing and Speech (this is now called IEEE Transactions on Professional Communications; look this article up in the wet library), III, 2, 1970.

  43. B.A. Huberman, P. Pirolli, J. Pitkow, and R.J. Lukose, Strong Regularities in World Wide Web Surfing, Science, Vol. 280, pages 95-97, 1998.

  44. A. Joshi, On Proxy Agents, Mobility, and Web Access, ACM Baltzer Journal of Mobile Networks and Applications (MONET), 2000.

  45. F. Jiang and M.L. Littman, Approximate Dimension Equalization in Vector-Based Information Retrieval, In Proceedings of the Seventeenth International Conference on Machine Learning, 2000.

  46. J. Karat, C.-M. Karat, and J. Ukelson, Affordances, Motivation, and the Design of User Interfaces, Communications of the ACM, Vol. 43, No. 8, pages 63-65, 2000.

  47. H. Kautz, B. Selman, and M. Shah, ReferralWeb: Combining Social Networks and Collaborative Filtering, Communications of the ACM, Vol. 40, No. 3, pages 63-65, March 1997.

  48. B. Kitts, D. Freed, and M. Vrieze, Cross-Sell: A Fast Promotion-Tunable Customer-Item Recommendation Method Based on Conditional Independent Probabilities, Proceedings of the Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 437-446, ACM Press, August 2000.

  49. J. Kleinberg, Authoritative Sources in a Hyperlinked Environment, Journal of the ACM, Vol. 46, No. 5, pages 604-632, September 2000.

  50. J. Kleinberg, The Small-World Phenomenon: An Algorithmic Perspective, Nature, 2000.

  51. T.G. Kolda, and D.P. O'Leary, A Semidiscrete Matrix Decomposition for Latent Semantic Indexing in Information Retrieval, ACM Transactions on Information Systems, Vol. 16, No. 4, pages 322-346, 1998.

  52. J. Konstan, B. Miller, D. Maltz, J. Herlocker, L. Gordan, and J. Riedl, Grouplens: Applying Collaborative Filtering to Usenet News, Communications of the ACM, Vol. 40, No. 3, pages 77-87, March 1997.

  53. C. Knoblock et al., Modeling Web Resources for Information Integration, AAAI'98, 1998.

  54. S.R. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins, Trawling the Web for Emerging Cyber-Communities, Proceedings of the Eighth World Wide Web Conference, 1999.

  55. N. Kushmerick, D.S. Weld, and R.B. Doorenbos, Wrapper Induction for Information Extraction, In Proceedings of IJCAI'97, pages 729-737, 1997.

  56. S. Lawrence, Context in Web Search, IEEE Data Engineering Bulletin, Volume 23, Number 3, pages 25-32, September 2000.

  57. S. Lawrence and C. Lee Giles, Searching the World Wide Web, Science, Vol. 280, No. 5360, pages 98-100, 1998.

  58. D. D. Lee and H. S. Seung. Learning the Parts of Objects by Non-Negative Matrix Factorization, Nature, Vol 401, pages 788-791, 1999. Not available electronically (I think), will hand out copies in class.

  59. A.Y. Levy and D.S. Weld, Introduction to Intelligent Internet Systems, Artificial Intelligence, Vol. 118, No. 1-2, pages 1-14, April 2000. The issue also contains a whole bunch of articles, mostly expanded versions of conference papers.

  60. U. Manber, A. Patel, and J. Robison, The Business of Personalization: Experience with Personalization of Yahoo!, Communications of the ACM, Vol. 43, No. 8, pages 35-39, August 2000.

  61. A.O. Mendelzon, D. Rafiei, What do the Neighbours Think? Computing Web Page Reputations, IEEE Data Engineering Bulletin, Volume 23, Number 3, pages 9-16, September 2000.

  62. R.C. Miller and B.A. Myers, Integrating a Command Shell Into a Web Browser, In Proceedings of the USENIX 2000 Annual Technical Conference, San Diego, CA, pages 171-182, June 2000.

  63. B. Mobasher, R. Cooley, and J. Srivastava, Automatic Personalization Based on Web Usage Mining, Communications of the ACM, Vol. 43, No. 8, pages 142-151, August 2000.

  64. S. Nestorov, S. Abiteboul, and R. Motwani, Extracting Schema from Semistructured Data, In Proc. ACM SIGMOD, 1998.

  65. M.E.J. Newman, The Structure of Scientific Collaboration Networks, Technical Report 00-07-037, Santa Fe Institute, 2000.

  66. M.E.J. Newman, S. Strogatz, and D. Watts, Random Graphs with Arbitrary Degree Distribution and their Applications, Technical Report 00-07-042, Santa Fe Institute, 2000.

  67. D. Payton, Discovering Collaborators by Analyzing Trails through an Information Space, Proceedings of the AAAI Fall Symposium on Artificial Intelligence and Link Analysis, pages 84-87, October 1998.

  68. C. Papadimitriou, P. Raghavan, H. Tamaki, and S. Vempala, Latent Semantic Indexing: A Probabilistic Analysis, In Proceedings of PODS'98, 1998.

  69. M. Pazzani, K. Muramatsu, and D. Billsus, Syskill and Webert: Identifying Interesting Web Sites, In Proceedings of the Thirteenth National Conference on Artificial Intelligence, pages 54-61, Portland, OR, August 1996.

  70. M. Perkowitz and O. Etzioni, Adaptive Web Sites, Communications of the ACM, Vol. 42, No. 8, pages 152-158, 2000.

  71. T.A. Phelps and R. Wilensky, Robust Intra-Document Locations, In Proceedings of the 9th World Wide Web Conference, 1999.

  72. B.J. Pine, S. Davis, and B.J. Pine II, Mass Customization, Harvard Business School Press, Boston, MA, April 1999.

  73. P. Pirolli, J. Pitkow, and R. Rao, Silk from a Sow's Ear: Extracting Usable Structures from the Web, In Proc. CHI'96, 1996.

  74. N. Ramakrishnan, PIPE: Web Personalization by Partial Evaluation, IEEE Internet Computing, Vol. 4, No. 6, pages 21-31, Nov/Dec 2000.

  75. P. Resnick and H. Varian, Recommender Systems, Communications of the ACM, Vol. 40, No. 3, pages 56-58, March 1997.

  76. M.B. Rosson, Integrating Development of Task and Object Models, Communications of the ACM, Vol. 42, No. 1, pages 49-56, 1999.

  77. J. Rucker and J. Marcos, Siteseer: Personalized Navigation for the Web, Communications of the ACM, Vol. 40, No. 3, pages 73-76, March 1997.

  78. D. Rus and D. Subramanian, Customizing Information Capture and Access, ACM Transactions on Information Systems, Vol. 15, No. 1, pages 67-101, 1997.

  79. U. Sharadanand and P. Maes, Social Information Filtering: Algorithms for Automating "Word of Mouth", Proceedings of CHI'95 - Human Factors in Computing Systems, pages 210-217, May 1995.

  80. B. Schneiderman, Designing Information-Abundant Web Sites: Issues and Recommendations, International Journal of Human-Computer Studies, Vol. 47, No. 1, 1997.

  81. C. Shapiro and H. Varian, Information Rules: A Strategic Guide to the Network Economy, Harvard Business School Press, Boston, MA, November 1998.

  82. P. Shulam, From Muhammad Ali to Grandma Rose, Discover, pages 85-89, December 1998.

  83. M.F. Shwartz and D.C.M. Wood, Discovering Shared Interests Using Graph Analysis, Communications of the ACM, Vol. 36, No. 8, pages 78-89, August 1993.

  84. M. Spiliopoulou, Web Usage Mining for Web Site Evaluation, Communications of the ACM, Vol. 43, No. 8, pages 127-134, August 2000.

  85. G.W. Stewart, The Decompositional Approach to Matrix Computation, IEEE/AIP Computing in Science and Engineering, Vol. 2, No. 1, pages 50-59, January/February 2000.

  86. L. Terveen, W. Hill, B. Amento, D. McDonald, J. Creter, PHOAKS: A System for Sharing Recommendations, Communications of the ACM, Vol. 40, No. 3, pages 59-62, March 1997.

  87. L. Terveen, W. Hill, and B. Amento, Constructing, Organizing, and Visualizing Collections of Topically Related Web Resources, ACM Transactions on Computer-Human Interaction, Vol. 6, No. 1, pages 67-94, March 1999.

  88. S. Wasserman and K. Faust, Social Network Analysis: Methods and Applications, Cambridge University Press, New York, 1994.

  89. D. Watts and S. Strogatz, Collective Dynamics of "Small-World" Networks, Nature, Vol. 393, No. 6, pages 440-442, June 1998. Again, Nature articles cannot be posted on the web, I think.

  90. A. Wexelblat and P. Maes, Footprints: History-Rich Web Browsing, In Proceedings of the Conference on Computer-Assisted Information Retrieval (RIAO), pages 75-84, 1997.

  91. X. Zhu, J. Yu, and J. Doyle, Heavy-Tailed Distributions, Generalized Source Coding and Optimal Web Layout Design, In Proceedings of INFOCOMM'2001, 2001.