[1] Ricardo Baeza-Yates and Carlos Castillo. Caracterizando la web chilena. In Encuentro chileno de ciencias de la computación, Punta Arenas, Chile, 2000. Sociedad Chilena de Ciencias de la Computación.
[2] Ricardo Baeza-Yates and Carlos Castillo. Relating web characteristics with link based web page ranking. In Proceedings of String Processing and Information Retrieval SPIRE, pages 21–32, Laguna San Rafael, Chile, 2001. IEEE CS Press.
[3] Ricardo Baeza-Yates and Carlos Castillo. Características de la web chilena 2004. Technical report, Center for Web Research, University of Chile, 2005.
[4] Ricardo Baeza-Yates and Carlos Castillo. Relationship between web links and trade. Proceedings of the 15th international conference on World Wide Web, pages 927–928, 2006.
[5] Ricardo Baeza-Yates and Carlos Castillo. WIRE: Web Information Retrieval Environment, 2006. http://www.cwr.cl/projects/WIRE/.
[6] Ricardo Baeza-Yates, Carlos Castillo, and Eduardo Graells. Características de la web chilena 2006. Technical report, Center for Web Research, University of Chile, 2007.
[7] Ricardo Baeza-Yates, Carlos Castillo, and Vicente López. Características de la web de españa. El Profesional de la Información, 15(1), January 2006.
[8] Ricardo Baeza-Yates and Felipe Lalanne. Characteristics of the korean web. Technical report, Korea–Chile IT Cooperation Center ITCC, 2004.
[9] Ricardo Baeza-Yates, Bárbara Poblete, and Felipe Saint-Jean. Evolución de la web chilena 2001–2002. Technical report, Center for Web Research, University of Chile, 2003.
[10] Albert-László Barabási. Linked: The New Science of Networks. Perseus Books Group, May 2002.
[11] A.A. Benczur, K. Csalogany, D. Fogaras, E. Friedman, T. Sarlos, M. Uher, and E. Windhager. Searching a small national domain–a preliminary report. Poster Proceedings of Conference on World Wide Web, 2003.
[12] T. Berners-Lee, L. Masinter, and M. McCahill. RFC1738: Uniform Resource Locators (URL). Internet RFCs, 1994.
[13] P. Boldi, B. Codenotti, M. Santini, and S. Vigna. Structural properties of the African web. The Eleventh International WWW Conference, May, 2002.
[14] A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Wiener. Graph structure in the web: experiments and models. Proceedings of the ninth WWW Conference, 2000.
[15] Carlos Castillo, Bartłomiej Starosta, and Marcin Sydow. Crawl.pl: Measuring statistical and structural properties of the polish web. Studia Informatica, 1(8):43–73, 2007.
[16] J. Cho, N. Shivakumar, and H. Garcia-Molina. Finding replicated web collections. ACM SIGMOD, pages 355–366, 1999.
[17] Internet Systems Consortium. Internet Domain Survey, 2007. http://www.isc.org/ds/.
[18] Brian D. Davison. Topical locality in the web. In SIGIR ’00: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pages 272–279, New York, NY, USA, 2000. ACM Press.
[19] S. Dill, R. Kumar, K.S. McCurley, S. Rajagopalan, D. Sivakumar, and A. Tomkins. Self-Similarity In the Web. ACM Transactions on Internet Technology, 2(3):205–223, 2002.
[20] Efthimis Efthimiadis and Carlos Castillo. Charting the Greek Web. In Proceedings of the Conference of the American Society for Information Science and Technology (ASIST), Providence, Rhode Island, USA, November 2004. American Society for Information Science and Technology.
[21] D. Gomes and M.J. Silva. A characterization of the portuguese web. 3rd ECDL Workshop on Web Archives, Trondheim, Norway, 21, 2003.
[22] A. Gulli and A. Signorini. The indexable web is more than 11.5 billion pages. In WWW ’05: Special interest tracks and posters of the 14th international conference on World Wide Web, pages 902–903, New York, NY, USA, 2005. ACM Press.
[23] Z. Gyongyi and H. Garcia-Molina. Web spam taxonomy. First International Workshop on Adversarial Information Retrieval on the Web, 2005.
[24] Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604–632, 1999.
[25] Guowei Liu, Yong Yu, Jie Han, and Guirong Xue. China web graph measurements and evolution. In Web Technologies Research and Development (APWeb), pages 668–679, Shanghai, China, 2005. Springer Berlin / Heidelberg.
[26] Microsoft. ASP: Active Server Pages, 2006. http://msdn.microsoft.com/asp.net/.
[27] Marco Modesto, Álvaro Pereira, Nivio Ziviani, Carlos Castillo, and Ricardo Baeza-Yates. Um novo retrato da web brasileira. In Proceedings of XXXII SEMISH, pages 2005–2017, São Leopoldo, Brazil, 2005.
[28] L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web, 1998.
[29] G. Pandurangan, P. Raghavan, and E. Upfal. Using PageRank to Characterize Web Structure. 8th Annual International Computing and Combinatorics Conference (COCOON), pages 330–339, 2002.
[30] A. Rauber, A. Aschenbrenner, O. Witvoet, R.M. Bruckner, and M. Kaiser. Uncovering Information Hidden in Web Archives. D-Lib Magazine, 8(12):1082–9873, 2002.
[31] S. Sanguanpong, P.P. Nga, S. Keretho, Y. Poovarawan, and S. Warangrit. Measuring and analysis of the Thai World Wide Web. Proceeding of the Asia Pacific Advance Network conference, pages 225–230, 2000.
[32] T. Suel and J. Yuan. Compressing the graph structure of the web. Data Compression Conference (DCC), pages 213–222, 2001.
[33] M. Thelwall and D. Wilkinson. Graph structure in three national academic Webs: Power laws with anomalies. Journal of the American Society for Information Science and Technology, 54(8):706–712, 2003.
[34] Gabriel Tolosa, Fernando Bordignon, Ricardo Baeza-Yates, and Carlos Castillo. Characterization of the argentinian web. Cybermetrics, 11(1):3+, July 2007.
[35] Gabriel H. Tolosa, Fernando R. Bordignon, and Pablo J. Lavallén. Caracterización del espacio web de perú. 2006.
[36] Eveline A. Veloso, Edleno de Moura, P. Golgher, A. da Silva, R. Almeida, A. Laender, Ribeiro B. Neto, and Nivio Ziviani. Um retrato da Web Brasileira. In Proceedings of Simposio Brasileiro de Computacao, Curitiba, Brasil, 2000.
[37] George K. Zipf. Human Behavior and the Principle of Least Effort. Addison-Wesley (Reading MA), 1949.