FREDRIC C. GEY

Curriculum Vitae

Research Interests: Information Access,

Social Science Information Systems, Semantic Mapping

gey@berkeley.edu

http://ucdata.berkeley.edu/gey.html

 

EDUCATION:

University of California, Berkeley. Ph.D. in information science, 1993; M.A. in mathematics, 1964.  Harvey Mudd College. B.S. "with distinction" in mathematics, 1962.

 

EXPERIENCE:

University of California (1968-present):

UC Data Archive & Technical Assistance [UC DATA]

     (2005-present) Information Scientist

Research into multilingual information access for European and East Asian languages.  Work on syntaxes for semantic mapping between economic classification systems. Workshop co-chair (with the National Institute of Informatics of Tokyo and the Swedish Institute of Computer Science) for the 2009 workshop Information Access in a Multilingual World. http://www.sics.se/events/clir2009. Workshop co-chair for Social Science Data Librarianship, a University Curriculum at IASSIST 2008, Stanford University:

https://www.stanford.edu/group/ADS/cgi-bin/drupal/social-science-data-librarianship-a-university-curriculum

 Designed prototype geographic visual interfaces to California and USA statistical information (see web page for URLs).

 

(Summer 2010, JAPAN) Visiting Researcher, National Institute of Informatics, Tokyo, Japan.  Developed evaluation process for specialized geographic and temporal search queries (e.g. “When and where did George Kennan die?”).  Resulted in two refereed papers in computer science workshops (see publications)

(Fall 2008, GERMANY) Visiting Scholar, Department of Information Systems, University of Hildesheim, northern Germany.  Presented lectures on Social Science and Digital Humanities subjects at the University, and on fuzzy search across languages at the German Information Retrieval Conference, Universität Würzburg.  Lectured, consulted with and advised PhD students on research topics.

(Summer 2007, JAPAN) Visiting Researcher, National Institute of Informatics, Tokyo, Japan.  Developed Japanese-English bilingual lexicon of technical terms (see publications).

 

     (1989-2005):  Assistant Director, UC DATA (retired June 2005)

Co-manager of social science and health statistics databases for the UC Berkeley Campus.  Metadata consultant for unique online digital libraries of Census data available on the web PI and project manager for preservation and archiving of historical census data http://ucdata.berkeley.edu/projects/censusrescue

. Member of the Expert committee of the DDI Alliance, a consortium of social science data archives working on a common standard for XML representation and archiving of numeric Social Science data..

Lawrence Berkeley Laboratory (1968-1989):  Staff  Scientist, Computer Science Division

      Principal investigator and project manager for numerous multi-million dollar, multi-year government software information systems research and development projects.

Bell Laboratories (1964-1967):  Member of Technical  Staff, Analytical Mechanics Department.

 

 GRANTS/CONTRACTS as PI, Co-PI or Senior Investigator since 1996 ($3.616 million total as PI or Co-PI):

 

NEH (National Endowment for Humanities) (with IMLS)

     2007-2010 Context and Relationships: Ireland and Irish Studies                      $350,000 (co-PI)

IMLS (Institute of Museum and Library Services) National Leadership Grant for Libraries,

     2006-2009 Bringing Lives to Light: Biography in Context                           $398,000 (co-PI)

     2004-2006: Support for the Learner: What, Where, When, and Why           $240,000 (co-PI)

     2002-2004  Going Places in the Catalog: Improved Geographic Access    $242,000 (co-PI)

     1999-2002: Seamless Searching of Numeric and Textual Resources            $243,000 (co-PI)

National Science Foundation (Computer & Information Science and Engineering Division)

     2006-2010, Semantics Management and Semantics Service (Eco-informatics Project),

    Berkeley Water Center                                                                           $1M (Senior Investigator)

    1996-2000, Probabilistic Retrieval Using the Method of Logistic Regression $300,000 (PI).

 

DARPA TIDES program: and Information Management Program

    1999-2004 Translingual Information Management Using Domain Ontologies $430,000 (co-PI)

    1997-2001, Search Support for Unfamiliar Metadata Vocabularies ,             $1,147,000 (co-PI)

University of California: California Digital Library

   2002-2004: Historical Census Rescue Project                                                      $196,000 (PI)

University of California: California Policy Research Center

   2000-2001 Impact of Welfare Reform on Agricultural and Rural California Counties $40,000 (co-PI).

University of California  Institute for Mexico and the United States

   2002-2004: California Latino Demographic Data Book, Third Edition                 $ 30,000 (PI).

 

SELECTED RECENT PUBLICATIONS (since 1999):

 

Gey11a            F Gey, “Syntaxes for Mapping Between Classification Systems,” Lawrence Berkeley Laboratory Report, forthcoming

Gey11b            F Gey, “Dynamic Change in Classification Systems: A Preliminary Model for Instance-based Matching,” Lawrence Berkeley Laboratory Report, forthcoming, currently available at: http://metadata.berkeley.edu/gey/publications/gey-dynamic-change-classification-systems-model.pdf.

GeKaLa10b     F. Gey, N. Kando, and R. Larson. “The Crucial Role of Semantic Discovery and Markup in Geo-temporal Search.” In Proceedings of the Third Workshop on Exploiting Semantic Annotations in Information Retrieval (ESAIR), Toronto, October 26, 2010.

GeKaLa10a     F Gey, N Kando  and R Larson, “An Evaluation of Geographic and Temporal Search,” in Proceedings of workshop LWA2010 - Lernen, Wissen, Adaptivität. 4.-6. Oktober 2010, Kassel, Germany (available at http://www.kde.cs.uni-kassel.de/conf/lwa10/papers/ir7.pdf)

SaEtAl10         Diana Santos ... Fredric C. Gey, et al, “GikiCLEF: Crosscultural Issues in Multilingual Information Access.” Proceedings of LREC 2010, Language Resources and Evaluation Conference, Malta, May 17-23, 2010.

McCetal09       J McCarthy, et al (F Gey), “Data Modeling and Harmonization with OWL: Opportunities and Lessons Learned,” Proceedings of the Fifth International Workshop on Semantic Web Enabled Software Engineering, Virginia, USA Oct 26, 2009.

Gey09              F Gey, “Romanization – An Untapped Resource for Out-of-Vocabulary Machine Translation for CLIR,” Proceedings of the SIGIR 2009 Workshop: Information Access in a Multilingual World, Boston, July 23, 2009, pp 49-51.

GeKaKa09      F Gey, N Kando and J Karlgren, editors, Proceedings of the SIGIR 2009 Workshop: Information Access in a Multilingual World: Transitioning from Research to Real-World Applications, Boston, July 23, 2009, 61pp.

GeyLa08          F Gey and Larson, R, “Relevance Levels in Patent Mining,” Proceedings of the 2nd International  Workshop on Evaluating Information Access, Tokyo, Japan, Dec 16, 2008, pp 57-59.

LaGey08          R Larson and F Gey, “Patent Mining: A Baseline Approach,” Proceedings of the 7th NTCIR Workshop on Evaluation of Asian Language Information Access Technologies, ISBN: 978-4-86049-044-7, Tokyo, Japan, Dec 16-19, 2008, pp 325-361.

GeEtal08b        Gey, F, R Shaw, R Larson and B Pateman, “Biography as Events in Time and Space” Proceedings of ACM Geographic Information Systems (GIS) 2008 Conference, Irvine, California, Nov 5-7, 2008.

GeEtal08a       F Gey, R Shaw, R Larson, M Buckland, B Pateman and D Melia,  “Marking Up Cultural Materials for Time and Geography” European Conference on Digital Libraries (ECDL) Workshop on Intelligent Access to Cultural Heritage," Aarhus, Denmark, September 18, 2008.

GeEvKa08       Gey, F, DK Evans and N Kando, “A Japanese-English Technical Lexicon for Translation and Language Research,” Proceedings of the Language Resources and Evaluation Conference (LREC 2008), Morocco, May 27-30, 2008 (Springer proceedings).

MaGeEtal08     Mandl, T, F Gey et al, “An Evaluation Resource for Geographic Information Retrieval,” Proceedings of the Language Resources and Evaluation Conference (LREC 2008), Morocco, May 27-30, 2008 (Springer proceedings).

Gey07              Gey, F, “Search Between Chinese and Japanese Text Collections,” Proceedings of the 6th NTCIR Workshop on Evaluation of Asian Language Information Access Technologies, ISBN: 978-4-86049-039-3, Tokyo, Japan, May 16-18, 2007, pp 73-76 .

GeyEtal07b      Gey, F et al, “Challenges to Multilingual Geographic Information Retrieval in GeoCLEF,” Proceedings of the First International Workshop on Evaluation of Information Access (EVIA), ISBN: 978-4-86049-040-9, Tokyo Japan, May 15, 2007, pp 74-77. 

BuEtal07          M Buckland, A Chen, F Gey, R Larson, R Mostern & V Petras 2007‚”Geographic Search: Catalogs, Gazetteers, and Maps.” College & Research Libraries, Vol 68, no. 5 (Sept 2007): pp 376-387.

GeyEtal07a      Gey, F et al, “GeoCLEF 2006: the CLEF 2006 Cross-Language Geographic Information Retrieval Track OverviewEvaluation of Multilingual and Multi-Modal Information Retrieval, Revised Selected Papers from the CLEF 2006 Workshop, Alicante, Spain, Springer Lecture Notes in Computer Science,  LCNS 4730, Summer 2007, pp 852-876.

BuChGeLa06   M Buckland, A Chen, F Gey & R Larson, 2006. “Search Across Different Media: Numeric Data Sets and Text Files.” Information Technology and Libraries. December 2006, pp 181-189.

Gey06              Gey, F, “Real-World Understanding for Multilingual Statistical Tables,” Proceedings of the SIGIR-2006 Workshop on New Directions in Multilingual Information Access, Seattle, August 2006, pp 19-21

OlGe06            Olken, F and F Gey, “Social Science Data Librarianship: A University Curriculum,” Presented at IASSIST 2006, Conference of International Association of Social Science Information Science and Technology, Ann Arbor May 2006, available at http://hpcrd.lbl.gov/staff/olken/ssdl/iassist_ssdl_curriculum.pdf.

Gey 05b           Gey, F, “How Similar are Chinese and Japanese for Cross-Language Information Retrieval?”  Proceedings of the 5th NTCIR Workshop on Evaluation of Asian Language Information Access Technologies, ISBN: 4-86049-033-9, Tokyo, Japan, December 2005, pp 171-174.

Gey 05a           Gey, F, “Searching a Russian Document Collection using English, Chinese and Japanese Queries,” in Multilingual Information Access for Text, Speech and Images, Springer Computer Science Series LNCS 3491 / 2005, pp.200-206.

GePeKa 05      Gey, F, N Kando and C Peters, “Cross-language Information Retrieval, the Way Ahead”,  introduction to the special issue on Cross-Language Information Retrieval, edited by Gey, Kando and Peters, Information Processing & Management, Volume 41, Issue 3, May 2005, Pages 415-431. (http://www.sciencedirect.com/science/article/B6VC8-4F4B2B7-3/2/be49ddaa50d2135bb60248a3f7c36d8f)

GeJaStEi 04    Gey, F, C Jiang,  J Stiles and I Einowski, California Latino Demographic Data Book, Third Edition (web edition only, 220 pages), December 2004 available at http://ucdata.berkeley.edu/new_web/projects/latino3/Databook2004.pdf.

ChGe 04          Chen, A and F Gey, “Multilingual Information Retrieval Using Machine Translation, Relevance Feedback and Decompounding,” Information Retrieval. V. 7(1-2): 149-182 (2004).

BrSpGeWi 04 Brady, H, M Sprague, F Gey and M Wiseman, Seasonal Employment and Welfare Use in California’s Agricultural and Rural counties,  California Policy Research Center Report, June 2004,  24 pages, available at http://www.ucop.edu/cprc/seasonal.pdf.

Ge 04               F Gey, Chinese and Korean Topic Search of Japanese News Collections, Working Notes of the Fourth NTCIR Workshop on Asian Language Retrieval and Question Answering, Tokyo Japan, June 2004, pp 214-218.

ChGe04           Chen, A and F Gey, “Combining Query Translation and Document Translation in Cross-Language Retrieval” in Comparative Evaluation of Multilingual Information Access Systems: 4th Workshop of the Cross-Language Evaluation Forum, Springer Computer Science Series LNCS 3237 / 2004, pp. 108-121.

PePeGe 04       Petras, V, N Perelman and F Gey, “UC Berkeley at CLEF-2003 – Russian Language Experiments and Domain-Specific Cross-Language Retrieval,” in Comparative Evaluation of Multilingual Information Access Systems: 4th Workshop of the Cross-Language Evaluation Forum, Springer Computer Science Series LNCS 3237 / 2004, pp.401-411.

ChGe03           Chen, A and F Gey, “Building an Arabic Stemmer for Information Retrieval,” The Eleventh Text Retrieval Conference, TREC-2002, NIST Special Publication 500-251, May 2003, pp. 631-639. (see http://trec.nist.gov/)

OaGe03           Oard, D and F Gey, “The TREC-2002 Arabic-English CLIR Track,” The Eleventh Text Retrieval Conference, TREC-2002, NIST Special Publication 500-251, May 2003, pp. 17-26.

GeChJa 03        Gey, F, A Chen and H Jiang, “Multilingual Information Retrieval,” entry in Encyclopedia of Library and Information Science,  Second Edition Revised and Expanded, Marcel Dekker, Inc.,  Spring 2003. pp 1895-1905.

FrPoGePe 02 James C. French, Allison L. Powell, Fredric C. Gey, Natalia Perelman: Exploiting Manual Indexing to Improve Collection Selection and Retrieval Effectiveness. Information Retrieval. V. 5(4): (2002), pp 323-351

Gey 02              Gey, F, “Prospects for Machine Translation of the Tamil Language,” in Proceedings of Tamil Internet 2002, K Kalyanasundaram, Ed., International Forum for Information Technology in Tamil, San Francisco, September 2002, pp 221-225.

GeJaPe02          Gey, F, H Jiang and N Perelman, “Working with Russian Queries for the GIRT, Bilingual and Multilingual Tasks at CLEF”, Evaluation of Cross-Language Information Retrieval Systems, Springer Computer Science Series LNCS 2406 , 2002,  pp 235-243.

LaGeCh 02          Larson, R, F Gey and A Chen, “Harvesting Translingual Vocabulary Mappings for Multilingual Digital Libraries,” in Proceedings of the Joint Conference on Digital Library Research, Portland, Oregon, July 2002, pp 185-190. 

 

BrSpGe 02    Brady, H, M Sprague, F Gey and M Wiseman, “Seasonal Employment Dynamics and Welfare Use in Rural and Agricultural California Counties,” in Rural Dimensions of Welfare Reform, Weber, Duncan and Whitener Eds, Upjohn Institute for Employment Research, 2002, pp 147-175.

GeChLaBu 02    Gey, F,  A Chen, R Larson and M Buckland, “Translingual vocabulary mappings for multilingual information access”, Proceedings ACM SIGIR-2002, Tampere, Finland, August 2002, pp 455-456.

PoFrGePe 01    Powell, A, J French, F Gey and N Perelman,  “Exploiting a Controlled Vocabulary to Improve Retrieval Effectiveness” Proceedings of ACM CIKM-2001, Tenth International Conference on Information and Knowledge Management, Atlanta, Georgia, November 2001, pp 199-206.

GeBuChLa 01  Gey, F, M Buckland, A Chen, and R Larson, “Entry Vocabulary – A Technology to Enhance Digital Search,” Proceedings of HLT-2001, First International Conference on Human Language Technology, March 18-21, 2001, San Diego, California, pp 91-95.

GeChJiKi 00      Fredric Gey, Aitao Chen, Hailing Jiang and Kazauki Kishida “Lessons Learned from the University of California Participation in the First NTCIR Workshop,” Japan Information Processing Society Journal, V. 41, No. 8, August 2000, pp 910-912 (in Japanese).

KiNoChGe 99   Kim, Y, B Norgard, A Chen and F Gey   “Using Ordinary Language to Access Metadata of Diverse Types of Information Resources: Trade Classification and Numeric Data” Proceedings of the 62nd Annual Meeting of the American Society for Information Science, Washington DC, Oct 31-Nov 4, 1999, pp 172-180.

 

Contact: e-mail: gey@berkeley.edu Ph.(510) 643-1298, FAX (510) 643-8292

UC Data Archive & Technical Assistance, University of California

2538 Channing Way, #5100

Berkeley CA 94720-5100

Service/Synergistic Activities: External curriculum reviewer of new graduate information technology curriculum, University of Aizu, Japan, 2009. External Masters/PhD dissertation examiner, 2007, University of New South Wales and 2006, Royal Melbourne Institute of Technology, Australia. Steering committee member for the European language search evaluation forum CLEF (organized under the European Commission DELOS program by the Italian National Research Council: http://www.clef-campaign.org/).  Minitrack Co-organizer, HICSS (Hawaii Conference on System Sciences), 2000-2007. General Chair of ACM SIGIR’99, the 22nd Annual Conference on Research and Development in Information Retrieval, UC Berkeley, August 1999.  NSF Panel reviewer: Digital Preservation 2004, Information and Data Management, 2002 and 1997.   Conference Review Committees: AIRS 2006 (Asian Information Retrieval Systems), Human Language Technologies conferences, HLT-2005&2006, SIGIR-2007, 2006, 2005, 2004, 1996, HICSS 2000-2007, CIKM 2002.

Memberships and Awards:

     Who’s Who In America, 2009. Fulbright Senior Specialists Academic Roster, 2008.

     Life member, IEEE (Institute for Electrical and Electronic Engineers), member since 1969

     Association for Computing Machinery, since 1964 – Service Award in 1999 as Chair of SIGIR 1999, the 22nd International Conference on Research and Development in Information Retrieval.  International Association for Social Science Information and Technology (IASSIST), since 1990.