Go GovStat Home!!
Go GovStat Home!!
 


[Projects]
Faster | Nesstar | Addsia | Mission | Metanet | I2T/Ferrett | VDC | IMDB

FASTER (Flexible Access to Statistics, Tables and Electronic Resources)
http://www.faster-data.org/index.htm

About FASTER
FASTER will create an easy to use and flexible tool for access to official and other statistical data. It will do this by developing an environment, incorporating components of existing systems wherever possible, that takes advantage of recent developments in metadata, client side functionality and Web security and statistical disclosure control.
FASTER is supported by the European Commission under the auspices of the Fifth Framework Programme

Metadata

Statistical metadata is at the core of the FASTER project as developments in metadata within the academic social science and official statistics communities are combined.

Technology
FASTER will make extensive use of XML and RDF in the development of a three tier architecture based on the outcome of two metadata workshops in 2000.

NESSTAR (Networked Social Science Tools and Resources)
www.nesstar.org/

About NESSTAR
NESSTAR has been supported by the European Commission in the Telematics Applications Programme of the IVth Framework and the Information Society Technologies programme of the Vth Framework via the FASTER project.

Papers and presentations
NESSTAR: A Semantic Web Application for Statistical Data and Metadata
Brief introduction to the NESSTAR system and NEOOM. A revised version of a paper presented at the Real World RDF and Semantic Web Applications Workshop, WWW2002 Conference, May 2002.
http://www.nesstar.org/sdk/nesstar2002.pdf

An Infrastructure for Data Dissemination via the Internet
http://www.nesstar.org/papers/NesstarOverview.ppt

top

ADDSIA (Access to Distributed Databases for Statistical Information and Analysis)
http://homepages.ed.ac.uk/addsia/

ADDSIA is a multi national reseach project funded by the European Union. It aims to use distributed database techniques and World Wide Web (WWW) technology in order to facilitate more effective access to statistical data by Europe's research and policy community.

Project Summary (From: http://www.ed.ac.uk/ces/Projects/addsia.htm):
The main concept of ADDSIA is to allow aggregated data from different data sources to be passed to a central location and merged using statistical algorithms which take into account the characteristics (defined by metadata) of the system. This project formally ended in March 2000, after a three-month extension. The final year of the project was spent in consolidating the code developed by the partners. The overall approach of ADDSIA assumed a hierarchical situation where Data Providers would supply information on their data to a Domain manager. While we were successful in defining the architecture and developing the framework for capturing data and metadata, we felt that the approach was too restrictive. Consequently we developed the MISSION architecture and this proposal was successful. The new project started in January 2000, and incorporates ideas and code from ADDSIA

Papers and presentations
Metadata and XML for Organizing and Accessing Multiple Statistical Data Sources
http://www.asc.org.uk/Events/Sep99/Pres/yaxin.ppt

Bi, Y., Murtagh, F. (1998). The Roles of Statistical Metadata and XML in Structuring and Retrieving Statistical Information. Proc. of NTTS(New Techniques & Technologies for Statistics)'98.
http://europa.eu.int/en/comm/eurostat/research/conferences/ntts-98/papers/cp/011c.pdf

 

MISSION (Multi-Agent Integration Of Shared Statistical Information Over The (Inter)Net)
http://www.epros.ed.ac.uk/mission/

MISSION aims to provide a modular system of software which will enable providers of official statistics to publish their data in a unified, and unifying, framework, and to allow consumers of statistics to access these data in an informed manner with minimum effort.

Project Summary (From: http://www.ed.ac.uk/ces/Projects/mission.htm)
The main goal of this project is to utilise the World Wide Web and emerging agent based technologies to provide a modular system of software which will enable providers of official statistics to publish their data in a unified framework, and to allow consumers of statistics to access these data in an informed manner with minimum effort. The project is progressing well within the proposed timeplan. Several deliverables have been completed: presentation material, initial technical implementation plan, public web site and the synthesised description of user needs. In addition, a private web site has been set up to aid communication within the consortium and procedures have been agreed for communication and peer reviewing. There have been three successful project meetings as well as a more technical oriented one. Further, there has been good progress, using the above-named procedures, in the formal specification of the system and in agreement on the architecture and system integration. Planning for the first test in February 2001 is now underway.

Papers and presentations
Yaxin Bi and Joanne Lamb(2001). Facilitating Integration of Distributed Statistical Databases Using Metadata and XML. Proc. of ETK&NTTS 2001.
http://webfarm.jrc.cec.eu.int/etk-ntts/Papers/final_papers/en187.pdf

top

METANET
http://www.epros.ed.ac.uk/metanet/

MetaNet - A network of excellence for harmonising and synthesising the development of statistical metadata - is a part of the European Union Fifth framework Research and Development program. Specifically it is part of the Information Society Technology strand, number IST-1999-29093.

Project Summary (From: http://www.ed.ac.uk/ces/Projects/metanet.htm)
MetaNet will be a network consisting of experts and users from NSIs, users of official statistics, researchers and developers to consolidate the work on metadata models that has been carried out in NSIs, in the fourth framework, in the Eurostat Supcom projects as well as in current work in fifth framework projects.

I2T / FERRETT

Census Bureau's DataFerrett
http://dataferrett.census.gov/TheDataWeb/index.html
DataFerrett supports metadata searches across surveys, on-the-fly variable recoding, complex tabulations, and graphics. DataFerrett is working to promote interoperability with the DDI format.

Putting Government Information at Citizens' Fingertips. Envision 16(3).
http://www.npaci.edu/envision/v16.3/dice.html
Database Design and Data Loading
http://www.sdsc.edu/~baru/dg_2000.ppt

top

VDC
The Virtual Data Center Project: An Operational Social Science Digital Data Library
http://thedata.org/index.shtml

An operational, open-source, digital library to enable the sharing of quantitative research data, and the development of distributed virtual collections of data and documentation.

[from the Introduction page]
The demand for social science data exists, and will only grow with easier availability. The use of data to researchers is obvious, but students and citizens also need to access to data if they are to understand the world and the issues of public policy that the nation faces. They also need to understand data to manage their own lives effectively - whether that entails managing their health or their money. Our project will bring social science data closer to students in elite universities and in community colleges, and closer to citizens through public libraries.
Under a grant from the Digital Libraries Initiative - Phase 2, we are developing the Virtual Data Center (VDC), an instrument to manage and share numerical social science data easily for teaching and research purposes across multiple institutions.

Integrated Meta Data Base (IMDB)

[from http://www.statcan.ca/english/freepub/11-533-XIE/about.htm]
Integrated metadata - Statistics Canada has developed an Integrated MetaData Base (IMDB) to provide a central repository for qualitative information regarding statistical programs at Statistics Canada. It integrates many existing repositories and provides new features such as:

  • A direct link between on-line services such as CANSIM, Canadian Statistics and the Online Catalogue and the meta data pertaining to the statistical program that provided the information
  • Presently metadata pertains to survey-level information such as a general description of the survey and the methodological procedures used to collect, verify and analyze data. This includes sampling plans, data capture, error detection, estimation methods, time series processes and disclosure control methods. Data quality measures such as response rate, coverage error, imputation and sampling error will also be described. Sampling error will be expressed as coefficients of variation. It is also possible to directly access full-length documents such as data quality reports and user guides.
  • Metadata is now kept up to date with each data release. Survey managers update the existing metadata to provide information that pertains to the most recent data release. An image of the questionnaire used to collect the data will also be provided.

The next phase of the project will include detailed metadata about variables being measured.

Papers and presentations
Johanis, P. (2001). Role of the Integrated Metabase at Statistical Canada
http://www.statcan.ca/english/conferences/symposium2001/session21/s21c.pdf
[abstract by the author]
The Integrated Metadatabase is a corporate repository of information on each of Statistics Canada¡¯s surveys. This information includes a description of data sources and methodology, definitions of concepts and variables measured and indicators of data quality. It provides an effective vehicle for communicating data quality to data users. Its coverage is exhaustive of Statistics Canada¡¯s data holdings, the information on data quality provided complies with the Policy in Informing Users of Methodology and Data Quality and it is presented in a
consistent and systematic fashion.