Eugene Agichtein

Emory University
Mathematics and Computer Science Department
400 Dowman Drive, Suite W401
Atlanta, GA 30322
Telephone: +1-404-727-7962
eugene@mathcs.emory.edu
http://www.mathcs.emory.edu/~eugene/

Education

Professional Employment

Honors and Awards

PUBLICATIONS
http://www.mathcs.emory.edu/~eugene/publications.html

Patent

Tutorials

  1. Towards Web-Scale Information Extraction, Eugene Agichtein, webcast, ACM SIGKDD Web Seminar, March 2007
  2. Scalable Information Extraction and Integration, Eugene Agichtein and Sunita Sarawagi, presented at the ACM International Conference on Knowledge Discovery and Data Mining (KDD), 2006

Invited Papers

  1. E. Agichtein, Web Information Extraction and User Information Needs: Towards Closing the Gap, in the IEEE Data Engineering Bulletin issue on Web-Scale Data, Systems, and Semantics, December 2006
  2. E. Agichtein, Scaling Information Extraction to Large Document Collections, in the IEEE Data Engineering Bulletin issue on Searching and Mining Literature Digital Libraries, December 2005

Journal Papers

  1. S. Sahay, S. Mukherjea, E. Agichtein, E. V Garcia, S. Navathe, and A. Ram, Discovering Semantic Biomedical Relations utilizing the Web, to appear in the ACM Transactions on Knowledge Discovery from Data (TKDE), special issue on Bioinformatics, 2008
  2. P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano, Towards a Query Optimizer for Text-Centric Tasks, in ACM Transactions on Database Systems (TODS), vol. 32, no. 4, 2007
  3. E. Agichtein, S. Lawrence and L. Gravano, Learning to Find Answers to Questions on the Web, in ACM Transactions on Internet Technology (TOIT) Special Issue on "Machine Learning for the Internet", 2004
  4. H. Yu and E. Agichtein, Extracting Synonymous Gene and Protein Terms from Biological Literature, in Bioinformatics, 2003 (also in Proc. of ISMB 2003)
  5. F. M. Torres, E. Agichtein, L. Grinberg, G. Yu, and R. Q. Topper, A note on the application of the "Boltzmann simplex"-Simulated Annealing algorithm to global optimizations of argon and water clusters, Journal of Molecular Structure (THEOCHEM), 1997

Papers in Refereed Conferences

  1. Y. Liu, J. Bian, and E. Agichtein, Predicting Information Seeker Satisfaction in Community Question Answering, in Proc. of the ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR), 2008 (17% accepted)
  2. J. Bian, Y. Liu, E. Agichtein and H. Zha. Finding the Right Facts in the Crowd: Factoid Question Answering over Social Media, in Proc. of the International World Wide Web Conference (WWW), 2008 (11% accepted)
  3. Y. Liu and E. Agichtein, You've Got Answers: Towards Personalized Models for Predicting Success in Community Question Answering (short paper), in Proc. of the Annual Meeting of the Association for Computational Linguistics (ACL), 2008 (25% accepted)
  4. B. Li, Y. Liu, and E. Agichtein, CoCQA: Co-Training Over Questions and Answers for Predicting Question Subjectivity Orientation (full paper), in Proc. of Conference on Empirical Methods in Natural Language Processing (EMNLP), 2008 (21% accepted)
  5. E. Agichtein, C. Castillo, D. Donato, A. Gionis, G. Mishne, Finding High Quality Content in Social Media, in Proc. of the ACM Web Search and Data Mining Conference (WSDM), 2008 (16% accepted)
  6. C. Clarke, E. Agichtein, S. T. Dumais, and R. W. White, The Influence of Caption Features on Clickthrough Patterns in Web Search, in Proc. of the ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2007 (18% accepted)
  7. E. Agichtein, C. Burges, and E. Brill, Question Answering over Implicitly Structured Web Content, in Proc. of the IEEE/WIC/ACM Conference on Web Intelligence (WI), 2007 (17% accepted)
  8. P. Jurczyk and E. Agichtein, Discovering Authorities in Question Answer Communities Using Link Analysis (short paper),  in Proc. of the ACM Conference on Information and Knowledge Management (CIKM), 2007 (26% accepted)
  9. E. Agichtein, E. Brill, and S. T. Dumais, Improving Web Search Ranking by Incorporating User Behavior Information, in Proc. of the ACM SIGIR Conference on Research and Development on Information Retrieval (SIGIR), 2006 (19% accepted)
  10. E. Agichtein, E. Brill, S. T. Dumais, and R. Ragno, Learning User Interaction Models for Predicting Web Search Result Preferences,  in Proc. of the ACM SIGIR Conference on Research and Development on Information Retrieval (SIGIR), 2006 (19% accepted)
  11. P. Ipeirotis, E. Agichtein, P. Jain, and L. Gravano, To Search or to Crawl: Towards a Query Optimizer for Text-Centric Tasks, in Proc. of the ACM Conference on Management of Data (SIGMOD), Best Paper Award, 2006 (13% accepted)
  12. E. Agichtein and Z. Zheng, Identifying “Best Bet” Web Search Results by Mining Past User Behavior (short paper),  in Proc. of the ACM International Conference on Knowledge Discovery and Data Mining, (KDD), Industrial Applications track, 2006 (24% accepted)
  13. E. Agichtein, Confidence Estimation Methods for Partially Supervised Relation Extraction (short paper), in Proc. of the SIAM Conference on Data Mining (SDM), 2006 (30% accepted)
  14. E. Agichtein and S. Cucerzan, Predicting Accuracy of Extracting Information from Unstructured Text Collections, in Proc. of the ACM Conference on Information and Knowledge Management (CIKM), 2005 (18% accepted)
  15. E. Agichtein and V. Ganti, Mining Reference Tables for Automatic Text Segmentation, in Proc. of the  ACM International Conference on Knowledge Discovery and Data Mining (KDD), 2004 (12% accepted)
  16. E. Eskin and E. Agichtein, Combining Text Mining and Sequence Analysis to Discover Protein Functional Regions, in Proc. of the Pacific Symposium on Biocomputing (PSB), 2004 (28% accepted)
  17. E. Agichtein and L. Gravano, Querying Text Databases for Efficient Information Extraction, in Proc. of the IEEE International Conference on Data Engineering (ICDE), Best Student Paper Award, 2003 (14% accepted)
  18. H. Yu and E. Agichtein, Extracting Synonymous Gene and Protein Terms from Biological Literature, in Proc. of the Conference on Intelligent Systems for Molecular Biology (ISMB), 2003 (15% accepted)
  19. E. Agichtein, S. Lawrence and L. Gravano, Learning Search Engine Specific Query Transformations for Question Answering,  in the 10th World Wide Web Conference (WWW), 2001 (20% accepted)
  20. E. Agichtein and L. Gravano, Snowball: Extracting Relations from Large Plain-Text Collections, in the 5th ACM International Conference on Digital Libraries (ACM DL), 2000 (33% accepted)

Papers in Refereed Workshops and Poster and Demonstration Sessions

  1. Q. Guo, E. Agichtein, C. Clarke and A. Ashkan. Understanding "Abandoned" Ads: Towards Personalized Commercial Intent Inference via Mouse Movement Analysis, in Proc. of the SIGIR 2008 Workshop on Information Retrieval in Advertising (IRA), 2008
  2. A. Ashkan, C. Clarke, E. Agichtein and Q. Guo. Characterizing Query Intent From Ad Clickthrough Data, in Proc. of the SIGIR 2008 Workshop on Information Retrieval in Advertising (IRA), 2008
  3. J. Bian, Y. Liu, E. Agichtein and H. Zha, A Few Bad Votes Too Many? Towards Robust Ranking in Social Media,  in Proc. of the WWW Workshop on Adversarial Information Retrieval (AIRWeb), 2008
  4. Q. Guo and E. Agichtein, Exploring Client-Side Instrumentation for Personalized Search Intent Inference: Preliminary Experiments,  in Proc. of the  AAAI 2008 Workshop on Intelligent Techniques for Web Personalization and Recommender Systems (ITWP), 2008
  5. B. Li, Y. Liu, A. Ram, E. V. Garcia, and E. Agichtein, Subjectivity Analysis for Questions in QA Communities (poster), in Proc. of the ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR), 2008
  6. Y. Liu, E. Agichtein, On the Evolution of the Yahoo! Answers QA Community (poster), in Proc. of the ACM SIGIR International Conference on Research and Development in Information Retrieval (SIGIR), 2008
  7. Q. Guo, E. Agichtein, Exploring Mouse Movements for Inferring Query Intent (poster),  in Proc. of the ACM SIGIR International Conference on Research and Development in Information Retrieval, 2008
  8. P. Jurczyk and E. Agichtein. HITS on Question Answer Portals: an Exploration of Link Analysis for Author Ranking (poster),  in Proc. of the ACM SIGIR International Conference on Research and Development in Information Retrieval, 2007
  9. L. Xiong and E. Agichtein. Towards Privacy-Preserving Query Log Publishing, in Proc. of the Query Log Analysis: Social and Technological Challenges Workshop at WWW 2007
  10. S.Sahay, E. Agichtein, E.V. Garcia, B. Li, and A. Ram. Semantic Annotation and Inference for Medical Knowledge Discovery, in Proc. of NSF Symposium on Next Generation Data Mining Techniques  (NGDM), 2007
  11. E. Agichtein and S. Cucerzan, Predicting Extraction Performance by Using Context Language Models, n Proc. of the SIGIR Workshop on Methodologies and Evaluation of Lexical Cohesion Techniques in Real-World Applications (SIGIR ELECTRA), 2005
  12. E. Agichtein, S. Cucerzan, and E. Brill, Analysis of Factoid Questions for Effective Relation Extraction (poster),  in Proc. of the ACM SIGIR 2005
  13. E. Agichtein, P. Ipeirotis, and L. Gravano, Modeling Query-Based Access to Text Databases, in Proc. of the Sixth International Workshop on the Web and Databases (WebDB), 2003 (25% accepted)
  14. E. Agichtein, C.T. H. Ho, V. Josifovski, and J. Gerhardt. Extracting Relations from XML Documents, in Springer Lecture Notes in Computer Science (LNCS), Volume 2814, "Conceptual Modeling for Novel Application Domains"; also in the International Workshop on XML Schema and Data Management (XSDM), 2003
  15. E. Agichtein and Luis Gravano, QXtract: A Building Block for Efficient Information Extraction from Plain-Text Databases (demo), in the ACM International Conference on Management of Data (SIGMOD), 2003
  16. E. Agichtein, L. Gravano, J.Pavel, V. Sokolova, A. Voskoboynik. Snowball: A Prototype System for Extracting Relations from Large Text Collections (demo),  in the ACM International Conference on Management of Data (SIGMOD), 2001
  17. A. Borthwick, J. Sterling, E. Agichtein, and Ralph Grishman. Exploiting Diverse Knowledge Sources via Maximum Entropy in Named Entity Recognition, in the Sixth Workshop on Very Large Corpora, 1998
  18. E. Agichtein, E. Eskin and L. Gravano. Combining Strategies for Extracting Relations from Text Collections, in the ACM SIGMOD Workshop on Data Mining and Knowledge Discovery (DMKD, 2000  

Other Publications and Abstracts

  1. S. Sahay, B. Li, E. V. Garcia, E. Agichtein, and A. Ram,Domain Ontology Construction from Biomedical Text,  to appear in Proc. of the  2007 International Conference on Artificial Intelligence (ICAI), 2007
  2. S. Cucerzan and E. Agichtein, iFactoid Question Answering over Unstructured and Structured Content on the Web at TREC 2005, in the proceedings of the TREC 2005 conference
  3. E. Agichtein, Extracting Relations From Large Text Collections, Ph.D. Thesis, Columbia University, 2005
  4. A. Borthwick, J. Sterling, E. Agichtein, and R. Grishman, NYU: Description of the MENE Named Entity System as used in MUC-7,  in the proceedings of the 7th Message Understanding Conference (MUC-7)

Invited Talks

TEACHING EXPERIENCE

Emory University

Other Educational Activities

PROFESSIONAL SERVICE

Conference and Workshop Organization

Conference Technical Program Committee Service

Other Peer Reviewing Service

UNIVERSITY SERVICE


Last updated: August 2008.