Shaozhi Ye Department of Computer Science, University of California, Davis
(530) 752-3128
[email protected] http://wwwcsif.cs.ucdavis.edu/˜yeshao
Education Sep. 2005 - Present • Ph.D student, Computer Science, University of California, Davis, GPA: 3.96/4.0 Received in Jan. 2005 • M.S., Electronic Engineering, Tsinghua University, Beijing, China – Thesis: Distributed search engine link model and accelerated PageRank algorithm, advisor: Dr. Xing Li Received in Jul. 2002 • B.S., Electronic Engineering, Tsinghua University, Beijing, China – Thesis: Distributed web crawler implementation and optimization, advisor: Dr. Xing Li Research Interests • Web Search and Mining • Distributed Systems Research and Related Experience Jun. 2006 - Sep. 2006 • Software Engineer Intern, IBM Almaden Research, San Jose, CA – Benchmarked IBM General Parallel File System, performance tuning and bottleneck identification. – Gained hands on experience with GNU gprof, IBM Rational Purify, and Intel VTune. Apr. 2005 - Jun. 2005 • Research Intern, Microsoft Research Asia, Beijing, China – Investigated co-authoring and citation relationship among ACM/IEEE/DBLP/CiteSeer papers. Sep. 2002 - Jan. 2005 • Research Assistant, Compass Group, Tsinghua Univ., Beijing, China – Implemented PageRank algorithm with optimizations on data structures and computation operations. – Took the 1st place at Chinese Web Page Categorization Competition with a Rocchio classifier. Jul. 2003 - Jul. 2004 • Research Intern, Microsoft Research Asia, Beijing, China – Proposed a duplicate document detection algorithm based on search queries. – Investigated parameter correlations in large scale duplicate document detection algorithms. Jul. 2001 - Aug. 2001 • Software Engineer Intern, Kaipu Internet Information Co., China – Programed an automatic Java code template generation toolkit with J2EE. Sep. 2000 - Jul. 2002 • Research Assistant, Compass Group, Tsinghua Univ., Beijing, China – Developed a distributed web crawler and analyzed its performance bottlenecks – Monitored 1,000 IPv6 websites and analyzed their growth trends. – Developed an FTP search engine which indexed 200 FTP sites. Awards • Excellent Academic Performance Scholarship, First Prize. Tsinghua Univ, 2004. (Top 25 out of 575) • The 5th place at Named/homepage Finding task and the 8th place at Topic Distillation task in Web Track, the 12th Text Retrieval Conference (TREC), 2003. With Microsoft Research Asia. • The 1st place at Chinese Web Page Categorization Competition, the first Chinese Symposium on Search Engine and Web Mining, Beijing, China, Apr 2003. With Compass Group, Tsinghua Univ. Skills and Strengths Languages: C/C++, Perl, LATEX, Python, R, Java, awk, sed and shell script programming. Operating Systems: Linux, FreeBSD and Solaris development and administration since 1999. Standard Tests: TOEFL/TWE: 653/4.0 (Jan. 2004), GRE: V 550 (73%), Q 790 (94%), A 760 (94%) (Aug. 2002). Journal Papers 1. Shaozhi Ye, Ji-Rong Wen, and Wei-Ying Ma. A systematic study on parameter correlations in large scale duplicate document detection. Knowledge and Information Systems, Vol.14, No.2, pp 217-232, Feb. 2008. 2. Ming Jia, Jiangtao Wen, Shaozhi Ye, and Xing Li. Restricted fast MAP decoding of VLC. IEEE Communication Letters, vol.9, no.10, pp 909–911, Oct. 2005. 3. Shaozhi Ye, Hui Liu, Yue Li, Hui Huang, and Xing Li. Development of IPv6 networks viewed from the angle of search engine. In Zhongxing Telecom Technology, Vol.40, pp 1–3, 2002. (in Chinese)
4. Hui Liu, Shaozhi Ye, Hui Huang, and Xing Li. IPv6 network analysis based on search engine. In Telecommunication Science, No.3, pp 43–45, 2002. (in Chinese) Conference and Workshop Papers
∗
1. Ming Jia, Shaozhi Ye, Xing Li, and Julie Dickerson. Web Site Recommendation Using HTTP Traffic. In Proceedings of the 7th IEEE International Conference on Data Mining (ICDM’07), 2007. (19.2%) 2. Lerone Banks, Shaozhi Ye, Yue Huang, and S. Felix Wu. Davis Social Links: integrating social networks with Internet routing. In Proceedings of ACM SIGCOMM Workshop on Large-Scale Attack Defense (LSAD’07), 2007. 3. Shaozhi Ye, Ji-Rong Wen, and Wei-Ying Ma. A systematic study of parameter correlations in large scale duplicate document detection. In Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD’06), pp 275–284, 2006. (13.4%)(Best Student Paper Nomination) 4. Liang Chen, Shaozhi Ye, and Xing Li. Template detection for large scale search engine. In Proceedings of the 21st Annual ACM Symposium on Applied Computing (SAC’06), pp 1094–1098, April 2006. 5. Yangbo Zhu, Shaozhi Ye, and Xing Li. Distributed PageRank computation based on aggregation-disaggregation methods. In Proceedings of the 14th ACM Conference on Information and Knowledge Management (CIKM’05) , pp 578–585, 2005. (18%) 6. Yi Wang, Shaozhi Ye, and Xing Li. Understanding current IPv6 performance: A measurement study. In Proceedings of the 10th IEEE Symposium on Computers and Communications (ISCC’05), pp 71–76, 2005. 7. Jingfang Xu, Shaozhi Ye, and Xing Li. Query based Chinese phrase extraction for site search. In Proceedings of the 5th International Conference on Web Information Systems Engineering (WISE’04), pp 125–134, 2004. (24%) 8. Shaozhi Ye, Guohan Lu, and Xing Li. Workload-aware web crawling and server workload detection. In Proceedings of Asia-Pacific Advanced Network Research Workshop, pp 263–269, Jul 2004. 9. Shaozhi Ye, Ruihua Song, Ji-Rong Wen, and Wei-Ying Ma. A query-dependent duplicate detection approach for large scale search engines. In Proceedings of the 6th Asia Pacific Web Conference (APWeb’04), pp 48–58, 2004. 10. Ji-Rong Wen, Ruihua Song, Deng Cai, Kaihua Zhu, Shipeng Yu, Shaozhi Ye, and Wei-Ying Ma. Microsoft Research Asia at the Web Track of TREC 2003. In Proceedings of the 12th Text Retrieval Conference (TREC’03), pp 408–417, Nov 2003 11. Yue Li, Hui Liu, Gang Zhu, Shaozhi Ye, and Xing Li. Analysis of IPv6 over search engine. In Proceedings of the 5th Joint AEARU Workshop on Web Technology and Computer Science, Oct 2003. 12. Hui Liu, Ran Peng, Shaozhi Ye, and Xing Li. An efficient centroid based Chinese web page classifier. In Proceedings of Asia-Pacific Advanced Network Research Workshop, pp 9–14, 2003. Membership and Services Executive committee member, UC Davis Chinese students and scholars fellowship Seminar organizer, Next Generation Network Lab, Tsinghua Univ Student member, China Institute of Communications Student member, Asia-Pacific Network Group (APNG) Affairs committee member, Chinese-American Networking Symposium (CANS)
Sep.2006 - present Sep 2003 - Jun 2004 Nov 2002 - Jan 2005 May 2002 - Jan 2005 Aug 2002
Reviewing Activities The The The The The
32nd International Conference on Very Large Data Bases(VLDB’06) 2006 IEEE Conference on Communications (ICC’06) 3rd Chinese Symposium on Search Engine and Web Mining (SEWM’05) 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI’04) Joint Conference of 10th Asia-Pacific Conference on Communications and 5th International Symposium on Multi-Dimensional Mobile Communications (APCC/MDMC’04)
References Available upon request.
∗ Some
conferences and workshops are very selective, where the acceptance rate is given.