IRList Digest Volume 4 Number 56

Published in
· 1 year ago
IRList Digest           Monday, 28 November 1988      Volume 4 : Issue 56 
      
Today's Topics: 
   Query - CD-ROMs of Value 
   Discussion - CD-ROM Network Server 
              - Pedagogical Models for IR 
   Announcement - Workshop on Evaluation of NLP Systems 
                - Staffing at NSF 
                - Times Collection for IR Research 
                - New CSLI Publications 
      
News addresses are 
   Internet: fox@vtopus.cs.vt.edu 
   BITNET: foxea@vtcc1.bitnet (replaces foxea@vtvax3) 
      
---------------------------------------------------------------------- 
      
Date:     Tue, 11 Oct 88 11:16 N 
From:     <COR_HVH@HNYKUN52> 
Subject:  CD-ROM data of value? 

      
Dear Ed, 
      
 ... 
      
CD-Rom interfaces for microcomputers have reached the point where they 
become affordable for small departments and for home use. However, is 
there already any real use for them? Which data CDs are at this moment 
available (what is it called, what is on it, is it in a standard format, 
where can you get it, what does it cost)? 
      
========= 
      
An extension to this: what data will be on the Virginia Disk(s)? 
      
Best wishes, 
      
Hans van Halteren            COR_HVH @ HNYKUN52.BITNET 

[Note: Virginia Disc One is still in process - we have added 
a Time collection from Cornell via UMass, a Rutgers 
collection from Tefko Saracevic, and have almost all of the 
LISA collection from P. Willet - are still waiting for last 
piece of that.  Regarding other CD-ROMs - I think this year 
marks the turning point in price and availability - many new 
titles coming out in High Sierra or ISO form at lower and 
lower prices. - Ed] 
      
------------------------------ 

Date:     Tue, 11 Oct 88 12:12:20 PDT 
From:     PAAAAA7@CALSTATE 
Subject:  CD Roms and Such 

Mr. Fox; 
I was just given a copy of your CD-Rom letter to Dr. Dick Botting, 
and thought I would pass along a product I just found out about. 
Meridian Data Inc. markets an interface to plug a High Sierra CD-ROM 
into a Novell Network. Sounds like something we all could use! If you 
are interested, please let me know and I will grab the address for you. 
-Rich McGee 
Computer Center 
CSUSB 

------------------------------ 

Date:    Fri, 30 Sep 88 20:11 PDT 
To:      Ed A Fox                             <FOXEA@VTCC1.BITNET> 
From:    Marcia Bates                         <IFQ0MJB@UCLAMVS.BITNET> 
Subject: Pedagogical models for IR 

[Note: see earlier discussion in issues 39, 40, 46 - Ed.] 

I just came onto IRLIST, so I do not know the text of the original question 
regarding pedagogical models. However, I have some suggestions that may be of 
interest. 

First, one must ask whether the interest is in searches only within a given 
automated system, or in the overall process of retrieving info.  Some of the 
most important questions a searcher--whether end user or expert--must decide 
in a real-life search are which system to use and whether to use a manual or 
online system.Some of the most serious errors made by librarian-students and 
by practitioners involve initiating searches on systems that are quite 
inappropriate for the query in hand--looking for statistical data on biblio- 
graphic databases, for example. 

 If these broader questions are of interest, then there is a growing 
literature both in the online area and in the area of "bibliographic 
instruction" in the library field.  IRLIST members are probably more 
familiar with the online literature than with BI. For BI, Constance 
Mellon's 1987 book Bibliographic Instruction: The Second Generation is a 
good place to start.  Some otherinteresting recent articles dealing with 
college students are: 

Stoan, Stephen K. "Research in Library Skills..."  College and Research 
   Libraries, 45 (Mar 84): 99-109. 

Dunn, Kathleen  "Psychological Needs and Source Linkages in Undergraduate 
   Information Seeking Behavior."  College and Research Libraries 47 
   (Sept 86): 475-481. 

Also book and articles by Nigel Ford, who has dealt in great detail with 
higher education students. 

If, on the other hand, interest is in automated systems primarily, or in 
the design of search interfaces, a number of my own articles deal with many 
of these issues (often dealing with both manual and online systems 
simultaneously), with models or model fragments being more or less explicit. 
See my: 

"Information Search Tactics"  Journal of ASIS 30(July 1979): 205-214. 
     Describes classes of search models, classes of tactics, and 29 
     particular search tactics. 

 "Idea Tactics"  JASIS  30 (Sept 1979): 280-289. 
     17 additional tactics. 

 "Search Techniques."  Annual Review of Information Science and Technology 
     16 (1981): 139-169. 
        Reviews lib/info sci literature on psychology of searching to that date 

 "Locating Elusive Science Information: Some Search Techniques"  Special 
     Libraries 75 (Apr 84): 114-120. 
        Drawing on model of the scientific publication cycle, the searcher 
        is shown how to locate info thought to be inaccessible. 

 "The Fallacy of the Perfect 30-item Online Search."  RQ  24 (Fall 84): 43-50. 
        Discusses psychological traps for end users and intermediaries that 
        degrade quality of online searches. 

 "An Exploratory Paradigm for Online Information Retrieval."  In Intelligent 
     Information Systems for the Information Society, ed. by B.C. Brookes. 
     Amsterdam: Elsevier, 1986, p. 91-99. 
        Model of broad classes of info seeking plus discussion of two major 
        types of search in automated systems. 

 "Subject Access in Online Catalogs: A Design Model."  JASIS 37 (Nov 86): 
     357-376. 
        Model incorporating psychological and linguistic factors in design of 
        subject search interface for online catalogs.Drawing upon recent 
        research and thinking, departs dramatically from a number of 
        traditional assumptions. 

  "How to Use Information Search Tactics Online." ONLINE 11 (May 87): 47-54. 
     Groups and applies search tactics in online environment to help at 
     several states of search, as well as in cases where too many or too few 
     items retrieved. 

  "How to Use Controlled Vocabulary More Effectively in Online Searching. 
     ONLINE, Nov. 88, in press. 
         Argues that features of various indexing and classification 
         systems used in databases must be taken into account in online 
         search formulation, i.e., different indexing systems require 
         different strategies, so searcher must be able to recognize 
         controlled vocabulary types. 

See also David Bawden, "Information Systems and the Stimulation of 
Creativity"  Journal of Info Science 12 (86): 203-216. 

                                 --Marcia J. Bates  ifq0mjb@uclamvs.bitnet 
                                   (Graduate School of Library and 
                                        Information Science 
                                    120 PLB  UCLA 
                                    Los Angeles, CA 90024-1520 USA) 

------------------------------ 

Date: Fri, 2 Sep 88 12:19:15 EDT 
From: palmer@BURDVAX.PRC.UNISYS.COM 
Subject: nl evaluation workshop 

 

                   CALL FOR PARTICIPATION 

                        Workshop on 
     Evaluation of Natural Language Processing Systems 

                          Dec 8-9 
           Wayne Hotel, Wayne, PA (Philadelphia) 

     There has been much recent interest  in  the  difficult 
problem  of  evaluating  natural language systems.  With the 
exception of natural language interfaces there are few work- 
ing systems in existence, and they tend to be concerned with 
very different tasks and use equally  different  techniques. 
There  has been little agreement in the field about training 
sets and test sets, or  about  clearly  defined  subsets  of 
problems  that  constitute standards for different levels of 
performance.  Even those groups that have attempted a  meas- 
ure of self-evaluation have often been reduced to discussing 
a system's performance in isolation - comparing its  current 
performance  to  its  previous  performance  rather  than to 
another system. As this technology  begins  to  move  slowly 
into  the  marketplace, the need for useful evaluation tech- 
niques is becoming more and more obvious.  The  speech  com- 
munity  has  made some recent progress toward developing new 
methods of evaluation, and  it  is  time  that  the  natural 
language  community followed suit.  This is much more easily 
said than done and will require a concentrated effort on the 
part of the field. 

     There are certain premises that should underly any dis- 
cussion  of  evaluation  of natural language processing sys- 
tems: 

(1)  It should be possible to  discuss   system   evaluation 
     in   general  without  having to state whether the pur- 
     pose  of the  system is  "question-answering" or  "text 
     processing."     Evaluating   a   system   requires the 
     definition of an  application  task  in  terms  of  I/O 
     pairs   which   are  equally  applicable  to  question- 
     answering, text processing, or generation. 

(2)  There are two basic types of evaluation: a) "black  box 
     evaluation"  which  measures  system  performance  on a 
     given task in terms of well-defined I/O pairs;  and  b) 
     "glass  box  evaluation"  which  examines  the internal 
     workings of the system.  For example,  glass  box  per- 
     formance   evaluation   for  a  system that is supposed 
     to  perform semantic  and  pragmatic   analysis  should 
     include the  examination  of  predicate-argument  rela- 
     tions,  referents,  and temporal and causal relations. 

     Given these premises, the workshop will  be  structured 
around  the following three sessions: 1) Defining "glass box 
evaluation" and "black box evaluation." 2) Defining criteria 
for "black box evaluation." A Proposal for establishing task 
oriented benchmarks for NLP Systems (Session Chair  -   Beth 
Sundheim)  3)  Defining criteria for "glass box evaluation." 
(Session Chair - Jerry Hobbs)  Several  different  types  of 
systems will be discussed, including question-answering sys- 
tems, text processing systems and generation systems. 

[Note: too late for sending in papers - sorry for not 
getting this out sooner - I have omitted that section since 
it is no longer relevant - Ed.] 

                       Martha Palmer 
                           Unisys 
                   Research & Development 
                         PO Box 517 
                      Paoli, PA 19301 
                   palmer@prc.unisys.com 
                       (215) 648-7228 

------------------------------ 

Date: Tue, 27 Sep 88 23:41 EDT 
From: EHRICH@vtcs1.cs.vt.edu 
Subject: NSF staffing 

Recently someone asked me where to send something at NSF.  Here is the  
staffing of the CCR (Computer and Computation Research) and IRIS  
(Information, Robotics, and Intelligent Systems) divisions: 

CCR: 

Director: Peter Freeman 
Deputy Director: Helen Gigley 

Theory: Errol Lloyd 
Computer Architecture: Zeke Zalcstein 
Software Engineering: John Gannon 
Numeric and Symbolic Computation: Kamal Abdali 
Software Systems: Thomas Keenan (returning after October 15) 

 
IRIS: 

Director: Y.T. Chien 
Deputy Director: Bruce Barnes (surprise, folks?) 

Robotics and Machine Intelligence: Ken Laws 
Knowledge and Cognitive Systems: Henry Hamburger 
Database and Expert Systems: Maria Zemankova 
Interactive Systems: Hal Bamford 
Information Technology and Organizations: Laurence Rosenberg 

------------------------------ 

Date:     Mon, 10 Oct 88 17:00 EDT 
From: krovetz@UMass 
Subject:  times collection 
      
Ed, 
  Just wanted to let you know that I finally did get the old Times 
collection from Cornell (thanks to Chris Buckley).  I have four files: 
the documents themselves, the queries, a stopword list, and a set of 
relevance judgements.  Chris said he tried SMART on it and got an 
average performance of 64% precision (averaged over the 25%, 50% and 
75% recall levels).  This is pretty high, but the collection is also 
rather small (425 documents and 83 queries). 
      
-bob 
[Note: I am putting that collection and a few others onto 
Virginia Disc One - thanks to Bob for his perseverence in 
sending the many files over BITNET, till we have them all - Ed.] 

------------------------------ 

Date: Wed, 12 Oct 88 17:13:00 PDT 
From: Emma Pease <emma@CSLI.STANFORD.EDU> 
Subject: CSLI Calendar, October 13, 4:4 [Extract - Ed.] 

 ... 
			    NEW PUBLICATIONS 

   The following reports have recently been published.  They may be 
   obtained, or a full list acquired by writing to Trudy Vizmanos,  
   CSLI, Ventura Hall, Stanford, CA 94305-4115, or 
   publications@csli.stanford.edu. 

   112. Bare Plurals, Naked Relatives, and Their Kin. 
        Dietmar Zaefferer $2.50  

   113. Events and ``Logical Form''.        Stephen Neale $2.00  

   114. Backwards Anaphora and Discourse Structure: Some 
        Considerations.        Peter Sells $2.50  

   115. Toward a Linking Theory of Relation Changing Rules in LFG. 
        Lori Levin $4.00  

   116. Fuzzy Logic.        L. A. Zadeh $2.50  

   117. Dispositional Logic and Commonsense Reasoning. 
        L. A. Zadeh $2.00  

   118. Intention and Personal Policies.      Michael Bratman $2.00  

   119. Propositional Attitudes and Russellian Propositions. 
        Robert C.Moore $2.50  

   120. Unification and Agreement.        Michael Barlow $2.50  

   121. Extended Categorial Grammar.     Suson Yoo and Kiyong Lee $4.00 

   122. The Situation in Logic---IV: On the Model Theory of Common 
        Knowledge.     Jon Barwise $2.00 

   123. Unaccusative Verbs in Dutch and the Syntax-Semantics Interface. 
        Annie Zaenen $3.00  

   124. What Is Unification? A Categorical View of Substitution, 
        Equation and Solution.     Joseph A. Goguen $3.50  

   125. Types and Tokens in Linguistics.    Sylvain Bromberger $3.00  

   126. Determination, Uniformity, and Relevance: Normative Criteria  
        for Generalization and Reasoning by Analogy. 
        Todd Davies $4.50  

   127. Modal Subordination and Pronominal Anaphora in Discourse. 
        Craige Roberts $4.50  

   128. The Prince and the Phone Booth: Reporting Puzzling Beliefs. 
        Mark Crimmins and John Perry $3.50  

   129. Set Values for Unification-Based Grammar Formalisms and Logic 
        Programming.    William Rounds $4.00  

   130. Fifth Year Report of the Situated Language Research Program. 
        free  

   131. Locative Inversion in Chichewa: A Case Study of Factorization 
        in Grammar.     Joan Bresnan and Jonni M. Kanerva $5.00  

   132. An Information-Based Theory of Agreement.  
        Carl Pollard and Ivan A.Sag $4.00 

------------------------------ 
      
END OF IRList Digest 
********************
IRList Digest Volume 4 Number 56

Share this article

Let's discover also

IRList Digest Volume 3 Number 40

IRList Digest Volume 2 Number 60

IRList Digest Volume 4 Number 57

IRList Digest Volume 1 Number 12

IRList Digest Volume 5 Number 01

IRList Digest Volume 2 Number 31

IRList Digest Volume 3 Number 45

IRList Digest Volume 1 Number 15

IRList Digest Volume 1 Number 13

IRList Digest Volume 4 Number 36

Recent Articles

The Electrical Knowledge of Ancient Civilizations

Mind Control Cult Newsletter -- Issue #5

Mind Control Cult Newsletter -- Issue #4

Mind Control Cult Newsletter -- Issue #3

Mind Control Cult Newsletter -- Issue #2

Mind Control Cult Newsletter -- Issue #1

Physics (Book 8) by Aristotle

Physics (Book 7) by Aristotle

Physics (Book 6) by Aristotle

Physics (Book 5) by Aristotle

Recent Comments