The Turfgrass Information File (TGIF):
Database Characteristics and Content Evaluation

Carrie Preston

August 2000


The turfgrass sector comprises a large number of businesses, organizations, educational programs and research programs related to the use of grass in lawns, landscapes, golf courses, athletic fields, parks, and other non-agricultural contexts. Turf-related education programs currently exist at over 100 colleges and universities in North America, with over 30 related graduate education/research programs (Golf Course Superintendents Association of America, 1997). However, turfgrass researchers and professionals may have difficulty finding bibliographic access to published information in their field. Turfgrass literature, being unrelated to food production, may be outside the scope of bibliographic databases focused on agriculture or may be processed on a low-priority basis. Turfgrass research and practice is also heavily dependent on a literature of professional, trade, and university publications, many distributed on a local or regional basis, which are not peer-reviewed and so do not appear in scientific journal databases, while being too specialized to appear in general-interest nonscientific databases.

The Turfgrass Information File (TGIF) is a bibliographic database produced by the Turfgrass Information Center at Michigan State University. Created in 1984, TGIF is solely dedicated to indexing turfgrass-related literature, turfgrass being broadly defined as any grass cultivated in a non-agricultural context (Cookingham, 1999; Turfgrass Information Center, 2000a, 2000b). TGIF is intended to provide turfgrass researchers and professionals with a mode of access to bibliographic information in their field, by providing a large proportion of unique content, bibliographic information on turf-focused publications not indexed by other online databases. TGIF is also assumed to provide for more efficient searching of turf-related topics by reducing the need to eliminate turf-irrelevant content from search result sets.

Bibliographic databases can be evaluated using a number of criteria related to software, hardware, and content or "dataware" (Jacso, 1992). Even when evaluation is limited to database content, a number of factors may be taken into account, including general database scope, composition, source coverage, time period coverage; the range of serials indexed by the database; and the consistency and completeness of coverage of individual serials (Jacso, 1997, 1998). This report focuses on the content of TGIF, both in itself and in comparison to the general agricultural databases that are the main alternative for users seeking access to current, turfgrass-related bibliographic information. The general characteristics of TGIF will be summarized and its content evaluated on a variety of dimensions. TGIF will be also compared to three major agricultural databases: AGRICOLA, produced by the U.S. National Agricultural Library; AGRIS, produced by the Food and Agriculture Organization of the United Nations; and CAB ABSTRACTS, produced by CAB International in the United Kingdom (Thomas, 1990). TGIF will be compared to the agricultural databases in terms of utility for turf-related subject searching. The proportion of unique versus overlapping content between TGIF and each of the major agricultural databases will be examined.

General Description of TGIF

As of June 2000, TGIF contained over 65,000 bibliographic records, of which about 60,000 represent articles from print serials. The remaining records represent a range of other materials including books and book chapters, theses and dissertations, booklets and brochures, university agricultural extension service fact sheets, plant cultivar registrations, audiovisual materials, and electronic materials. Electronic materials indexed in TGIF include over 3,000 World Wide Web documents, of which 90% may be accessed by direct links from their TGIF records when the records are viewed in the Turfgrass Information Center's Web interface. In addition to documents available in full text on the Web, full text articles from certain print serials are currently being added to TGIF; as of June 2000 these totaled about 500 records. Material formats indexed by TGIF are summarized in Table 1.


Table 1. Material formats indexed by TGIF
Material format Total number of TGIF records (through record #65000) Percent of total TGIF records (through record #65000) Number of TGIF records for materials published in 1990 or later (through record #65000) Percent of TGIF records for materials published in 1990 or later (through record #65000)
Articles from print serials 59,853 92% 36,528 91%
Books and book chapters 1,815 3% 1,169 3%
Theses and dissertations 720 1% 257 1%
Booklets, brochures and fact sheets 643 1% 327 1%
World Wide Web documents 2,013 3% 1,838 5%
Electronic materials, other than World Wide Web documents 22 <1% 13 <1%
Audiovisual materials 52 <1% 8 <1%
Other (patents, manuscripts, cultivar registrations, etc.) 260 <1% 124 <1%
Total* 65,000 100% 39,971 100%
*Sums of individual document types are greater than total because some documents exist in both World Wide Web and print formats.

Serials indexed by TGIF are published by a variety of organizations for scientific, professional, and popular audiences. Serials indexed by TGIF include peer-reviewed journals; scientific research report sources such as Agronomy Abstracts and university turfgrass department research reports; national and international-level professional publications such as USGA Green Section Record, Golf Course Management, and International Turfgrass Bulletin; local professional newsletters; proceedings of local, national and international turf-related conferences; and popular magazines such as Golf Digest. TGIF currently includes records for articles drawn from over 1800 serials. As of August 2000, serials regularly scanned by TGIF included 129 scientific publications, 159 professional and trade publications, and 25 other publications. Over 50% of TGIF records represent content from professional and trade magazines, newsletters and conference proceedings; these publications are widely utilized within the turfgrass profession and most are not found in other indexes and databases. Major types of publications indexed by TGIF are summarized in Tables 2 and 3.


Table 2. Print article sources indexed by TGIF
Article source Total number of TGIF records (through record #65000)* Percent of total TGIF records (through record #65000) Number of TGIF records for materials published in 1990 or later (through record #65000) Percent of TGIF records for materials published in 1990 or later (through record #65000)
All scientific publications 17,767 30% 8,980 25%
  • peer-reviewed journals
  • 6,432
  • 11%
  • 2,638
  • 7%
  • non-peer reviewed research report venues
  • 11,478
  • 19%
  • 6,477
  • 18%
Professional and trade magazines and newsletters 33,166 55% 25,435 70%
Conference proceedings (where not subsumed into one of the above categories)** 4,099 7% 1,848 5%
Popular magazines and newspapers 828 1% 770 2%
Other/unknown 5,893 10% 737 2%
Total print articles 59,853 100% 36,528 100%
*sum total is greater than 59,853 because some serials may have multiple article types
**recent policy changes have led conference proceedings articles to be classed in TGIF as scientific reports or professional/trade publications as appropriate

Table 3. Serial publications regularly scanned by TGIF
Type of serial Number of serials scanned by TGIF* Percent
All scientific publications 129 41%
  • peer-reviewed journals
  • 53
  • 17%
  • non-peer reviewed research report venues
  • 77
  • 25%
Professional and trade magazines and newsletters 159 51%
Conference proceedings (where not subsumed into one of the above categories)** 15 5%
Popular magazines and newspapers 10 3%
Total 313 100%
*as of August 2000
**recent policy changes have led conference proceedings to be classed in TGIF as scientific reports or professional/trade publications as appropriate

Serials are selectively indexed according to subject relevance; most or all articles from turfgrass-focused professional and trade publications are indexed, while only turf-related articles are selected from broader agricultural journals. Book reviews and extremely brief "news item" type articles are typically not indexed. Efforts are made to restrict items published in more than one serial to a single TGIF record containing information on all known appearances of the item.

TGIF is primarily focused on indexing current materials, with ongoing secondary efforts to index older materials. Over 90% of the materials indexed by TGIF were published in or after 1970, and 60% were published in or after 1990. However, the database does include about 4,000 records for materials published before 1970, including all United States Golf Association Green Section serials published since 1921. Time period coverage of TGIF is summarized in Table 4.


Table 4. Time period coverage of TGIF
Date of publication Number of TGIF records (through record #65000) Percent of TGIF records (through record #65000)
Before 1920 83 <1%
1920-1929 959 ~2%
1930-1939 520 ~1%
1940-1949 505 ~1%
1950-1959 568 ~1%
1960-1969 1,273 2%
1970-1979 6,369 10%
1980-1989 14,561 23%
1990-1999 39,011 62%
Total through 1999 63,849 100%

Currently, TGIF is composed primarily of English-language materials, with a limited number of non-English materials represented, particularly articles from the German professional journal Rasen. Over 70% of the publications regularly indexed by TGIF have a North American geographic focus. However, TGIF does index a selection of professional and trade publications from the United Kingdom, continental Europe, Australia and New Zealand and a limited number from Asia, Africa and South America, as well as internationally focused peer-reviewed journals and conference proceedings. Language and geographic coverage of TGIF are summarized in Tables 5 and 6.


Table 5. Language coverage of TGIF
Language Total number of TGIF records (through record #65000)* Percent of TGIF records (through record #65000)
English 62,514 96%
German 799 1%
Russian 263 <1%
French 223 <1%
Japanese 194 <1%
Spanish 132 <1%
Other 403 1%
Unknown 565 1%
*Total is greater than 65000 because some documents appear in multiple languages.
Table 6. Serials scanned by TGIF, by geographic focus
Geographic focus Number of serials scanned by TGIF* Percent
International (peer-reviewed journals and proceedings of international conferences) 55 18%
North America 223 71%
United Kingdom 14 4%
Europe (other than United Kingdom) 9 3%
Oceania 7 2%
Asia 3 1%
Africa 1 <1%
South America 1 <1%
Total 313 100%
*as of August 2000

Items listed in TGIF are indexed using the Turfgrass Information Center's Turfgrass Thesaurus, a highly specific controlled vocabulary focused on turfgrass science. About 70% percent of existing records also include abstracts (provided either by the publisher or by the Turfgrass Information Center), full text within TGIF records, or full text on separate web pages directly linked to their TGIF records, and all records being added to the database on an ongoing database are provided with abstracts. TGIF is thus designed to facilitate both keyword (title, abstract and full text) and controlled vocabulary searching.

Comparison of TGIF with Major Agricultural Databases

Method

General search. In an attempt to roughly estimate the number of turf-related articles in the agricultural databases, a search for the keywords turf* or lawn* or golf was performed in each database. (The search term grass was not used in order to avoid confusion with materials on forage grasses, as the term turf(grass) tends to be preferred by authors writing about grasses used in non-agricultural contexts.) This search was intended as a very rough estimate using the means available to researchers in turfgrass science who would be unable to individually examine large numbers of articles.

Subject searches. TGIF and the three agricultural databases were also compared using five turf-related subject searches. In addition to TGIF, the Turfgrass Information Center produces Hot Turf TOPICs, a series of prepared subject bibliographies on turf-related topics of current interest. The numbers of online user requests for each subject bibliography were tabulated and the five most popular subjects selected as test searches, since these were topics of demonstrated interest among real searchers interested in turfgrass science. Keyword searches were composed in an attempt to isolate subject-relevant items in each of the four databases. Attempts were made to keep the searches used in TGIF and those used in the agricultural databases as similar as possible while attempting to eliminate non-turf-related materials from the agricultural database result sets. The subjects and search syntax used in the test searches are listed in Table 7.


Table 7. Search syntax for five turfgrass-related subject searches in two interfaces
Topic TGIF search syntax (using the Turfgrass Information Center's Power Search web-based interface) AGRICOLA, AGRIS, CAB search syntax (using Silver Platter WebSPIRS web-based interface)
Canada geese as a turfgrass pest ((canad* adj (goose or geese)) or (branta adj canadensis)) and year=1990:1999 ((canad* adj (goose or geese)) or (branta adj canadensis)) and (turf* or lawn* or golf) and py=1990-1999
Use of living organisms to control turfgrass pests ((biological adj control*) or biocontrol*) and year=1990:1999 ((biological adj control*) or biocontrol*) and (turf* or lawn* or golf) and py=1990-1999
Control of dollar spot disease of turfgrass ((dollar adj spot*) or (dollarspot*)) and control* and year=1990:1999 (((dollar adj spot*) or (dollarspot*)) and control*) and (turf* or lawn* or golf) and py=1990-1999
Irrigation of turfgrass with sewage effluent (effluent or wastewater or (waste adj water) or (recycled adj water) or (reclaimed adj water) or (sewage adj water)) and irrigat* and year=1990:1999 ((effluent or wastewater or (waste adj water) or (recycled adj water) or (reclaimed adj water) or (sewage adj water)) and irrigat*) and (turf* or lawn* or golf) and py=1990-1999
Algae as a turfgrass pest (algae not (pond* or lake* or ocean or marine)) and year=1990:1999 (algae not (pond* or lake* or ocean or marine)) and (turf* or lawn* or golf) and py=1990-1999

Results of each search were evaluated for relevance to the original subject, using the titles, abstracts and index terms provided by the databases. Items that appeared to mention the subject only in passing and those that were completely irrelevant to turf contexts (e.g. articles about marine algae rather than algae found on turfgrass) were classified as irrelevant. Uncertain cases (e.g., those in the agricultural databases where abstracts were not provided) were decided in favor of relevance. No clear cases of duplicate records within the same result set were found; procedures for dealing with duplicate records (e.g. Hood & Wilson, 1999) were therefore unnecessary. Search precision was measured as the ratio of relevant results to total results from each keyword search (Lancaster & Warner, 1993). Overlap between relevant result sets from TGIF and each of the other databases was also determined, defining overlap as follows (Gluck, 1990):
percent overlap = number of documents in the intersection of the two result sets X 100
number of documents in the union of the two result sets

Indexing of Major Turf-Related Publications. Databases were also compared for their indexing of major turf-related publications, particularly professional and trade publications published by national and international turf organizations. Databases were searched by publication title (for the agricultural databases) or TGIF serial code (for TGIF) for records drawn from 16 major publications.

All TGIF searches were performed using the Turfgrass Information Center's Power Search web-based search interface. All AGRICOLA, AGRIS and CAB searches were performed using Silver Platter's WebSPIRS web-based search interface as made available to the public at Michigan State University.

Results and Discussion

Results of rough general searches in each of the three agricultural databases are presented in Table 8. Though these searches are not meant to be compared directly to the full range of items available through TGIF, the results of the searches demonstrate the relatively small number of articles available to searchers who attempt to limit their agricultural database search results to turf-related items.


Table 8. Estimated numbers of turf-related records in three agricultural databases
Year(s) of publication Number of results of "turf* or lawn* or golf" free text search, limited by year of publication, in specified database* Number of TGIF records (up to R=65000)
AGRICOLA AGRIS CAB
1990-1999 3,464 3,893 3,734 39,011
1995 388 442 405 3,260
1998 397 392 354 4,294
1999 224 80 290 3,821
*searches performed using Silver Platter WebSPIRS web-based interface

Subject searches. As shown in Table 9, TGIF tended to produce considerably larger result sets than the agricultural databases when searching for turf-related topics, while maintaining high precision. TGIF search precision was comparatively high for all searches except the dollar spot disease search ("dollar spot" is a phrase not commonly found outside the turfgrass literature and so produced high precision in all databases). The algae and effluent water searches produced relatively low precision in the agricultural databases due to common non-turf-related uses of these terms (e.g., use of "turf" and lawn" for growths of marine algae and "effluent" for the outflow of water that occurs during various laboratory procedures). The biological control search tended to produce low precision due to variation in usage of the term (e.g., for "natural" pest control methods other than the use of living organisms) and a general tendency to return results focused primarily on chemical, rather than biological, pest control. Overall, TGIF allowed better isolation of relevant, turf-related items, even when using keyword searches rather than TGIF's turf-based controlled vocabulary.


Table 9. Result set size and search precision for turf-related topics in four databases
TGIF AGRICOLA AGRIS CAB
Topic Total results Relevant results Search precision Total results Relevant results Search precision Total results Relevant results Search precision Total results Relevant results Search precision
Canada geese 44 41 .93 4 3 .75 5 4 .80 5 4 .80
Bio-control 680 645 .95 93 52 .56 115 40 .35 168 74 .44
Dollar spot 347 322 .93 24 22 .92 24 23 .96 36 33 .92
Effluent water 181 159 .88 28 22 .79 26 18 .69 60 22 .37
Algae 127 100 .79 12 8 .67 13 9 .69 14 5 .36
Search precision = relevant results/total results for each database.

As shown in Tables 10-12, overlap between TGIF and the major agricultural databases was low for all searches, even taking into account the relatively large result sets produced by TGIF. Items commonly returned by both the agricultural databases and TGIF included scientific journal articles and, for AGRICOLA and AGRIS only, USGA Green Section Record articles. Items commonly returned by the agricultural databases only included non-U.S. publications (particularly for AGRIS and CAB) and agricultural extension service publications from U.S. universities (particularly for AGRICOLA). Items commonly returned by TGIF alone included articles from professional and trade publications, many of which are indexed only by TGIF among the four databases.


Table 10. Percent overlap of topic search results in TGIF and AGRICOLA
Topic Total relevant results returned by both databases Relevant results returned by TGIF only Relevant results returned by AGRICOLA only Relevant results returned by both databases Percent overlap*
Canada geese 43 40 1 2 4.65
Bio-control 673 621 28 24 3.57
Dollar spot 328 306 6 16 4.88
Effluent water 173 151 14 8 4.62
Algae 104 96 4 4 3.85
*Percent overlap = (relevant results returned by both databases/total relevant results returned by all databases) X 100
Table 11. Percent overlap of topic search results in TGIF and AGRIS
Topic Total relevant results returned by both databases Relevant results returned by TGIF only Relevant results returned by AGRIS only Relevant results returned by both databases Percent overlap*
Canada geese 45 40 1 3 6.67
Bio-control 675 625 30 20 2.96
Dollar spot 331 308 9 14 4.23
Effluent water 170 152 11 7 4.12
Algae 105 96 5 4 3.81
*Percent overlap = (relevant results returned by both databases/total relevant results returned by all databases) X 100
Table 12. Percent overlap of topic search results in TGIF and CAB
Topic Total relevant results returned by both databases Relevant results returned by TGIF only Relevant results returned by CAB only Relevant results returned by both databases Percent overlap*
Canada geese 43 39 2 2 4.65
Bio-control 695 621 50 24 3.45
Dollar spot 335 302 13 20 5.97
Effluent water 176 154 17 5 2.84
Algae 105 100 5 0 0.00
*Percent overlap = (relevant results returned by both databases/total relevant results returned by all databases) X 100

Indexing of Major Turf-Related Publications. As shown in Table 13, TGIF has considerably greater coverage of the sixteen major professional and trade publications than the major agricultural databases. Of the sixteen publications studied, only four had significant coverage in any of the agricultural databases.


Table 13. Indexing of several major turf-related professional and trade publications in four databases*
Publication Number of records in TGIF for publication years 1990-1999 Number of records in AGRICOLA for publication years 1990-1999 Number of records in AGRIS for publication years 1990-1999 Number of records in CAB for publication years 1990-1999
Golf Course Management (Golf Course Superintendents Association of America) 1,926 0 0 0
Greenkeeper International (British and International Golf Greenkeepers Association) 403 0 0 0
GreenMaster (Canadian Golf Course Superintendents Association) 215 0 0 0
Grounds Maintenance (Intertec Publishing, Overland Park, Kansas) 1,003 0 1,014 0
International Turfgrass Bulletin/Sports Turf Bulletin (Sports Turf Research Institute, United Kingdom) 294 0 5 12
Journal of Turfgrass Management (Haworth, New York) 60 0 41 55
Journal of Turfgrass Science/Journal of the Sports Turf Research Institute (Sports Turf Research Institute, United Kingdom) 152 0 72 79
Landscape Management (Advanstar Publications, Cleveland, Ohio) 988 0 0 0
New Zealand Turf Management Journal (New Zealand Turf Culture Institute) 375 0 0 0
Rasen (HORTUS, Bonn, Germany) 124 0 77 80
SportsTurf (Sports Turf Managers Association) 354 0 0 0
Turfax (International Sports Turf Institute, College Station, Texas) 84 0 0 0
TurfCraft International (Agricultural Publishers, Melbourne, Australia) 599 0 0 0
Turfgrass Trends (Advanstar Publications, Cleveland, Ohio) 180 0 0 0
TurfNews (Turfgrass Producers International) 526 0 0 0
USGA Green Section Record (United States Golf Association Green Section) 1,403 283 622 6
*As of August 2000.

Conclusions

The Turfgrass Information File (TGIF) appears to offer bibliographic access to a variety of turfgrass-related literature not indexed by other major databases. For researchers interested specifically in turf-related literature, TGIF offers advantages over general agricultural databases both in the amount of information available and in relevance of typical search results. A major advantage of TGIF include its coverage of professional and trade journals, conference proceedings, and university turfgrass research reports, all of which are widely utilized by turfgrass researchers and professionals and largely ignored by agricultural and general-interest databases. TGIF's sole focus on turf-related literature also allows for more precise searching while requiring less effort to produce focused, relevant search results.

One weakness of TGIF in comparison to the major agricultural databases is its relatively limited coverage of non-U.S. publications. However, the number of turf-related, non-U.S. publications found in AGRIS and CAB is so small that international researchers might want to supplement information found in those databases with U.S.-focused information found in TGIF. TGIF also lists relatively few agricultural extension publications from U.S. universities when compared to AGRICOLA. The Turfgrass Information Center does index agricultural extension publications but currently lacks a systematic mechanism for receiving and processing these publications; with greater availability of these publications over the World Wide Web, increasing numbers of these publications may be included in TGIF. Furthermore, the total number of results for turf-related searches in AGRICOLA was not large enough to suggest complete or near-complete coverage of agricultural extension publications in this database. Even taking into account some degree of advantage in the area of agricultural extension publications, the total number of turf-related publications unique to AGRICOLA is small enough that searchers might wish to use it as a supplementary information source when information provided by TGIF is inadequate.

Overall, TGIF provides greater access to turfgrass science literature than other available resources and should be of value to turfgrass researchers and professionals.


References

Cookingham, P. O. (1999). The Turfgrass Information File: Monitoring the turf science literature. In American Society of Agronomy, Crop Science Society of America, & Soil Science Society of America, 1999 Annual Meeting Abstracts. Madison, WI: American Society of Agronomy, 1999.

Gluck, M. (1990). A review of journal coverage overlap with an extension to the definition of overlap. Journal of the American Society for Information Science, 41(1), 43-60.

Golf Course Superintendents Association of America (1997). GCSAA college guide to the golf course management profession. Lawrence, KS: Golf Course Superintendents Association of America.

Hood, W. W., & Wilson, C. S. (1999). The distribution of bibliographic records in databases using different counting methods for duplicate records. Scientometrics, 46(3), 473-486.

Jacso, P. (1992). CD-ROM software, dataware, and hardware: Evaluation, selection, and installation. Englewood, CO: Libraries Unlimited.

Jacso, P. (1997). Content evaluation of databases. Annual Review of Information Science and Technology, 32, 231-267.

Jacso, P. (1998). Analyzing the journal coverage of abstracting/indexing databases at variable aggregate and analytic levels. Library & Information Science Research, 20(2), 133-151.

Lancaster, F. W., & Warner, A. J. (1993). Information retrieval today. Arlington, VA: Information Resources Press.

Thomas, S. E. (1990). Bibliographic control and agriculture. Library Trends, 38(3), 542-561.

Turfgrass Information Center (2000a). Turfgrass Information File (TGIF). http://www.lib.msu.edu/tgif/tgifda.htm (viewed August 22, 2000).

Turfgrass Information Center (2000b). Turfgrass Information File (TGIF) database specifications. http://www.lib.msu.edu/tgif/tgifspecs.htm (viewed August 22, 2000).