Notes
Slide Show
Outline
1
What Are Databases
Made Of?
  • Databases or files
    • Subfiles
      • Records
        • Fields
          • Subfields
2
A Sample Database Record
  • Personal author: Garreau, Joel.
  • Title: The nine nations of North America / Joel Garreau.
  • Publication info: Boston : Houghton Mifflin, 1981.
  • ISBN: 0395291240 : $12.95
  • Physical description: xvii, 427 p. : ill. ; 24 cm.
  • General note: Includes index.
  • Bibliography note: Bibliography: p. [397]-413.
3
A Sample Database Record (cont.)
  • Personal subject: Garreau, Joel.
  • Subject: United States--Description and travel--1960-
  • Subject: United States--Economic conditions--1961-
  • Subject: United States--Social conditions--1960-
  • Subject: Canada--Description and travel--1945-
  • Subject: Canada--Economic conditions--1945-
  • Subject: Canada--Social condition.


4
The Record’s Underlying Structure
  • 100:  10 : Garreau, Joel.
  • 245:  14 : The nine nations of North America /|cJoel Garreau.
  • 260:  0  : Boston :|bHoughton Mifflin,|c1981.
  • 020:     : 0395291240 :|c$12.95
  • 300:     : xvii, 427 p. :|bill. ;|c24 cm.
  • 500:     : Includes index.
  • 504:     : Bibliography: p. [397]-413.
  • 596:     : 5
5
The Record’s Underlying Structure (cont.)
  • 600:  10 : Garreau, Joel.
  • 651:   0 : United States|xDescription and travel|y1960-
  • 651:   0 : United States|xEconomic conditions|y1961-
  • 651:   0 : United States|xSocial conditions|y1960-
  • 651:   0 : Canada|xDescription and travel|y1945-
  • 651:   0 : Canada|xEconomic conditions|y1945-
  • 651:   0 : Canada|xSocial condition.



6
Matt’s Presidential Literature Database
  • RN - 101
  • AU - Mazlish, Bruce
  • TI - In Search of Nixon : A Psychohistorical Inquiry
  • SU – Nixon, Richard M. (Richard Milhouse), 1913-1994


7
Matt’s Presidential Literature Database (cont.)
  • RN – 102
  • AU – Eisenhower, Dwight D.
  • TI – At Ease : Stories I Tell My Friends
  • SU – Eisenhower, Dwight D. (Dwight David), 1890-1969
8
Matt’s Presidential Literature Database (cont.)
  • RN – 103
  • AU – Ellis, Joseph J.
  • TI – Passionate Sage : The Character and Legacy of John Adams
  • SU – Adams, John, 1743-1826


9
Basic vs Additional Index
  • For our toy database, our titles and subjects comprise the basic index
  • Our additional index will be the author index
10
Dialog’s Stop Words
  • an
  • and
  • by
  • for
  • from
  • of
  • the
  • to
  • with


11
Number all Words and/or Phrases
  • Mazlish, Bruce 101 AU1
  • in 101 TI1
  • search 101 TI2
  • nixon 101 TI4
  • a 101 TI5
  • psychohistorical 101 TI6
  • inquiry 101 TI7
12
Number all Words and/or Phrases (cont.)
  • nixon 101 SU1
  • richard 101 SU2
  • m 101 SU3
  • richard 101 SU4
  • milhouse 101 SU5
  • 1913 101 SU6


13
Number all Words and/or Phrases (cont.)
  • 1994 101 SU7
  • Nixon, Richard M. (Richard Milhouse), 1913-1994 101 SU8
  • Correction:  101 SU1, SU2, …, SU7
  • Eisenhower, Dwight D. 102 AU1
  • at 102 TI1
  • ease 102 TI2
14
Number all Words and/or Phrases (cont.)
  • stories 102 TI3
  • i 102 TI4
  • tell 102 TI5
  • my 102 TI6
  • friends 102 TI7
  • eisenhower 102 SU1
  • dwight 102 SU2
15
Number all Words and/or Phrases (cont.)
  • … And so on …


  • Next Step ŕ Alphabetize the list for the basic index
16
Alphabetize the List

  • 1743 103 SU3
  • 1826 103 SU4
  • 1890 102 SU6
  • 1913 101 SU6
  • 1969 102 SU7
  • 1994 101 SU7
17
Alphabetize the List (cont.)
  • a 101 TI5
  • adams 103 TI6
  • adams 103 SU1
  • Adams, John, 1743-1826 103 SU5
  • Correction:  103 SU1, SU2, SU3, SU4
  • at 102 TI1
  • character 103 TI3
  • d 102  SU3
18
Alphabetize the List (cont.)
  • david 102 SU 5
  • dwight 102 SU 2
  • dwight 102 SU 4
  • ease 102 TI 2
  • eisenhower 102 SU 1
  • Eisenhower, Dwight D. (Dwight David),
  • 1890-1969 102 SU 8
  • Correction:  102 SU1, SU2, … , SU7
  • … and so on



19
Author Additional Index
  • Eisenhower, Dwight D. 102 AU 1
  • Ellis, Joseph J. 103 AU 1
  • Mazlich, Bruce 101 AU 1


  • Note that this index is phrase indexed but NOT word indexed
20
A Record from Dialog
  • Let’s look beyond the book record of a catalog:
  • http://library.dialog.com/bluesheets/html/bl0001.html
21
Does the Internet have Structure?
  • http://www.contrib.andrew.cmu.edu/~matthewm/mrmwork.html
22
Starting a Search in ERIC on Dialog
  • ?BEGIN 1
  • or, in an abbreviated format:
  • ?b 1
23
"?B"
  • ?B 1
  •        30may03 22:38:44 User556323 Session D1.1
  •             $0.00    0.241 DialUnits FileHomeBase
  •      $0.00  Estimated cost FileHomeBase
  •      $0.04  INTERNET
  •      $0.04  Estimated cost this search
  •      $0.04  Estimated total session cost   0.241 DialUnits
  •  File   1:ERIC  1966-2003/May 10
  •        (c) format only 2003 The Dialog Corporation
  •       Set  Items  Description
  •       ---  -----  -----------
  • ?
24
Searching for a Word
  • ?SELECT mathematics


  • or, in abbreviated format:


  • ?s mathematics


25
 
26
Using Boolean (Logical) Operators
  • OR Operator Use OR to group synonymous terms
  • when at least one must be present.



  • AND Operator Use AND to connect terms when both
  • or all  must be present.



  • NOT Operator Use NOT to exclude records containing
  • a specified term.


27
Venn Diagram Depicting AND Logic
28
Venn Diagram Depicting OR Logic
29
 
30
Using OR Operator; Display Sets (DS) Command
  • ?S MATH OR MATHEMATICS
  •            10825  MATH
  •            54890  MATHEMATICS
  •       S5   58984  MATH OR MATHEMATICS
  • ?DS
  • Set     Items   Description
  • S1      54890   MATHEMATICS
  • S2       4055   FEAR
  • S3        116   S1 AND S2
  • S4     169200   1 AND 2
  • S5      58984   MATH OR MATHEMATICS
  • ?
31
"?S"
  • ?S S2 AND S5
  •             4055  S2
  •            58984  S5
  •       S6     138  S2 AND S5
  • ?DS
  • Set     Items   Description
  • S1      54890   MATHEMATICS
  • S2       4055   FEAR
  • S3        116   S1 AND S2
  • S4     169200   1 AND 2
  • S5      58984   MATH OR MATHEMATICS
  • S6        138   S2 AND S5
  • ?
32
“/ENG” – Limit by Language
  • ? S S3 OR S6
  •              116  S3
  •              138  S6
  •       S7     138  S3 OR S6
  • ?S S7/ENG
  • >>>Term "ENG" is not defined in file 1 and is ignored
  •       S8     138  S7/ENG
  • ?
  •                                  http://library.dialog.com/bluesheets/html/bl0001.html


33
Suffix Searching
  • ?s mathematics searches for the word ‘mathematics’ in the basic index (usually includes the abstract field and may include fulltext!


  • ?s mathematics/ti searches for the word ‘mathematics’ in the title field


  • ?s mathematics/ti,de searches for the word ‘mathematics’ in the title OR descriptor fields
34
EXPAND Command in the Language Index (LA=)
  • ? E LA=ENGLISH
  • Ref   Items  Index-term
  • E1       19  LA=DUTCH
  • E2        1  LA=EDO
  • E3   752078 *LA=ENGLISH
  • E4        1  LA=ESPERANTO
  • .
  • .
  • .
  • E8       14  LA=FINNISH
  • E9     3292  LA=FRENCH
  • E10       3  LA=FULANI
  • E11       1  LA=GANDA
  • E12     728  LA=GERMAN
  •           Enter P or PAGE for more
  • ?
35
Select Command From an Expanded List
  • ? S E3
  •       S9  752078  LA='ENGLISH'
  • ?DS
  • Set     Items   Description
  • S1      54890   MATHEMATICS
  • S2       4055   FEAR
  • S3        116   S1 AND S2
  • S4     169200   1 AND 2
  • S5      58984   MATH OR MATHEMATICS
  • S6        138   S2 AND S5
  • S7        138   S3 OR S6
  • S8        138   S7/ENG
  • S9     752078   LA='ENGLISH'
  • ?
36
Combining Our Results with the Set of All English Language Records; What Have We Spent So Far?!!!
  • ? S S7 AND S9
  •              138  S7
  •           752078  S9
  •      S10     129  S7 AND S9
  • ?COST
  •        30may03 22:42:18 User556323 Session D1.2
  •             $1.22    0.817 DialUnits File1
  •      $1.22  Estimated cost File1
  •      $0.20  INTERNET
  •      $1.42  Estimated cost this search
  •      $1.46  Estimated total session cost   1.057 DialUnits
  • ?
37
The TYPE Command
  • ? T S10/8/1-5
  •  10/8/1
  • DIALOG(R)File   1:(c) format only 2003 The Dialog Corporation. All rts. reserv.
  • 01106676 ERIC NO.: ED458214 CLEARINGHOUSE NO.: TM032844
  • The Debate over National Testing. ERIC Digest.
  •   April 2001 (20010400)
  • DESCRIPTORS: Academic Achievement; *Achievement Tests; Elementary Secondary Education; *Federal Government; *Government Role; *National Competency Tests; Performance Based Assessment; *Politics; Test Construction; *Test Use
  • IDENTIFIERS: ERIC Digests; *National Assessment of Educational Progress
38
"10/8/2"
  •  10/8/2
  • DIALOG(R)File   1:(c) format only 2003 The Dialog Corporation. All rts. reserv.
  • 01098402 ERIC NO.: ED452367 CLEARINGHOUSE NO.: CE081628
  • Women and Minorities in High-Tech Careers. ERIC Digest No. 226.
  •   2001 (20010000)
  • DESCRIPTORS: Attitude Change; *Career Education; Change Strategies;
  •   Community Colleges; Computer Attitudes; *Education Work Relationship; Educational Change; Educational Environment; Educational Policy; Educational Technology; Elementary Secondary Education; Employed Women; Employment Patterns; Equal Education; Information Needs; Job Training; Leadership; Literature Reviews; *Minority Groups; Needs Assessment; *Nontraditional Occupations; Recruitment; Role Models; Sex Fairness; *Technical Occupations; Technological Advancement; Trend Analysis; Two   Year Colleges; Vocational Education; *Womens Education
  • IDENTIFIERS: ERIC Digests
  •  10/8/3
  • .
  • .
  • .
39
Details of the Type Command

  • ?T S10/8/1-5


  • “S10” is the set number that you are choosing


  • “8” is the format that you’re choosing for your output


  • “1-5” are the first five records of your resultant set S10


  • http://library.dialog.com/bluesheets/html/bl0001.html


40
Expand of the Basic Index in ERIC
  • ?
  • E GRADE 5


  • Ref   Items   RT  Index-term
  • E1     3486    3   GRADE 3
  • E2     4327    3   GRADE 4
  • E3     4463    3 *GRADE 5
  • .
  • .
  • .
  • E7     2561    5   GRADE 9
  • E8        1            GRADEAID
  • E9       44           GRADEBOOK
  • E10       1           GRADEBOOK PROGRAMS
  • E11      15          GRADEBOOKS
  • E12       1           GRADECALC
  •           Enter P or PAGE for more
  • ?
41
"? S E3"
  • ? S E3
  •      S11    4463  'GRADE 5'
  • ?S S10 AND S11
  •              129  S10
  •             4463  S11
  •      S12       1  S10 AND S11
  • ?
42
"? T S12/9/1"
  • ? T S12/9/1
  •  12/9/1
  • DIALOG(R)File   1:ERIC
  • (c) format only 2002 The Dialog Corporation. All rts. reserv.
  • 00490065 ERIC NO.: ED219235 CLEARINGHOUSE NO.: SE038286
  • Teaching Problem Solving; the Effect of Algorithmic and Heuristic Problem
  • Solving Training in Relation to Task Complexity and Relevant Aptitudes.
  •   de Leeuw, L.;
  • CORP. SOURCE: Free Univ., Amsterdam (Netherlands). (BBB20582)
  •   16pp.
  •   1982 (19820000)
  • EDRS Price MF01/PC01 Plus Postage.
  • LANGUAGE: English
  • DOCUMENT TYPE: 143 (Reports--Research)
  • RECORD TYPE: ABSTRACT
  • COUNTRY OF PUBLICATION: Netherlands
  • JOURNAL ANNOUNCEMENT: RIEDEC1982
  •    Sixty-four fifth and sixth-grade pupils were taught number series
  • extrapolation by either an algorithm, fully prescribed …
43
LOGOFF Command and Session Costs
  • ?LOGOFF
  •        30may03 22:47:24 User556323 Session D1.2
  •             $1.83    1.222 DialUnits File1
  •                $0.00  5 Type(s) in Format  8
  •                $0.00  2 Type(s) in Format  9
  •             $0.00  7 Types
  •      $1.83  Estimated cost File1
  •      $0.45  INTERNET
  •      $2.28  Estimated cost this search
  •      $2.32  Estimated total session cost   1.462 DialUnits


  • Return to logon page!


44
Nesting
  • Consider the following commands:
  • ?s cat or feline
  • ?s leukemia
  • ?s s1 and s2
  • ?s cat or feline and leukemia
  • ?s (cat or feline) and leukemia


45
Why Nesting is so Important
  • You can enter any combination of logical operators in a single SELECT.


  • Use parentheses as needed to specify the correct order of processing.


  • Without parentheses, the order of processing  (for Dialog) is:


        • First: NOT operators
        • Second: AND operators
        • Third: OR operators

46
An Example of What Nesting Does
47
Truncation and Wildcards
  •  model
  •  models
  •  modeled
  •  modeler
  •  modeling
  •  modelled
  •  modelling


48
Truncation Varieties
  • Multiple character
    • ?s model?
  • Finite number of characters
    • ?s model????
  • But … what about one character?
    • ?s model? ?


49
Wildcards
  • ?s wom?n
    •  Searches for woman or women or womyn
  •  How about searches for labor or labour
    • Can’t be done in Dialog – must have the same number of characters that you wish to replace
  • Wildcards only need to come into play as needed