Notes
Outline
Understanding Statistical Concepts and Terms in Context: The GovStat Ontology and the Statistical Interactive Glossary
Stephanie W. Haas, Maria Cristina Pattuelli,
 Ron T. Brown, Jesse Wilbur
School of Information and Library Science
University of North Carolina at Chapel Hill
{haas, browr, pattm}@ils.unc.edu, jdwilbur@email.unc.edu
http://ils.unc.edu/govstat
Supported by NSF grant EIA 0131824
The GovStat Project
Find what you need,
understand what you find
Government statistical agencies are a rich source of information, but many barriers  to their use, especially by non-experts, exist.
Statistical Knowledge Network
integrate information across agency boundaries
provide alternate means of finding and viewing information
provide help when, where, and how it is most useful
Slide 3
Barriers to Using Statistical Information
Lack of statistical knowledge leads to difficulties:
searching
recognizing relevant statistics
choosing the most appropriate statistics
understanding what is presented
using and interpreting the information
Examples from
Metadata User Study
Why are there so many numbers for employment and unemployment?  Does it matter which I use?
The table lists mean and median income – which number is right?
I’ll use the seasonally unadjusted number, because I just want one number, not a set for each season of the year.
The “E” means it’s an estimate – they don’t know the real number.
Can I compare a seasonally adjusted number to an unadjusted number?
Basic Principles
User’s purpose is to find information, not learn more about statistics
Seeking help shouldn’t mean abandoning the task
Help should be integrated as seamlessly as possible into the statistical resources
Design Guidelines for SIG
and the GovStat Ontology
Limited coverage of concept and terms
Intended for “everyday users”, not experts
Provide explanations in variety of forms
Explanations must be attractive
Explanations should be coordinated with content – context specificity
SIG Framework
Terms
Frequently encountered terms
Basic level of statistical literacy
Strategies for term identification
examination of frequently-visited pages
anecdotal evidence from agency and non-agency consultants
metadata user study
web crawl of agency sites
Content
Basic level of explanation
Variety of delivery “packages”
definition, example
brief tutorial, demonstration
interactive simulation
combination
May incorporate related terms and concepts
May point to more advanced resources
Content
Outline
Context Specificity
Incorporate explanations in user’s work context
table- or statistic-specific, e.g., CPI, death rate, gasoline prices, etc.
agency- or concept-specific, incorporating relevant entities, e.g., employment, benefits, height and weight, etc.
general, universal context
Slide 13
User Control
Minimal interruption to user’s task
Inform user that help is available for a term
Allow user to select help presentation(s)
Allow user to control action
start, stop, repeat, supply values, etc.
Allow user to explore related terms (ontology)
GovStat Ontology
Scope of glossary plus connecting concepts and relationships
Taxonomic relationships support
provision of context-specific or more general explanations
concept explanation templates
Domain relationships support
combining related concepts in explanation
integration business rules
Future Directions
Evaluation of ontology and SIG
User exploration of statistical concepts and terms in ontology
Connections to GovStat Metadata Model
Services for information integration business rules
Extending context-specific help
domain terms for statistical agencies
other arenas
Slide 17