[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

IR-L Digest, Vol.XVI, No.37, Issue 473



IRLIST Digest                                       ISSN 1064-6965
October 4, 1999
Volume XVI, Number 37
Issue 473

******************************************************************

  I. QUERIES
        1.
XML or SGML Parser?
        2. Subject Tagging Schema
III. NOTICES
     A. Publications
        1.
Parsers From Around The World
        2. Version 27, Scholarly Electronic Publishing Bibliography
     B. Meetings
        1.
IEEE Vis'99: Early Registration Deadline
        2. CNI/CIC: Working Together Workshop
        3. IRAL '99: CFPARTICIPATION
IV. PROJECTS
     C. Awards, Fellowships, Grants, & Scholarships
        1.
NSF Information Technology Research (ITR):
           Letters of Intent 11/15/99 or 01/05/2K

******************************************************************

I. QUERIES

I.1.
Fr: Fernando AGUIAR <aguiar@emse.fr>
Re: XML or SGML Parser?

Hi,

I was wondering if someone could point me to a XML or SGML parser written in C or C++... Why? I'm developing a search engine to index/search web sites separately. These local search engines will be represented by agents that will cooperate forwarding queries.

I've already written a parser to the robot that crawls the site. But now I need something somewhat more robust.  Constraints :-))) : It must be simple and easy to use and well-documented..

In fact, I took a look at the parser module in libwww (from
www.w3.org), but I admit that I did not understand very well how it works. It seemed rather complicated.

Thanks in advance for your help.
Regards,
Fernando

Fernando AGUIAR tel: + 33 (0) 4.77.42.01.72
Reseaux, Information, Multimedia fax: + 33 (0) 4.77.42.66.66
Centre SIMADE
Ecole des Mines de Saint Etienne e-mail: aguiar@emse.fr
158 Cours Fauriel
42023 Saint Etienne Cedex 2
FRANCE

**********

I.2.
Fr: Steve Rossiter"<SRossiter@email.msn.com>
Re: Subject Tagging Schema

Hello,
In an effort to move towards the paperless office I'm stumped by the need for a system of tagging documents by subject so that future users can find information relevant to their needs. To completely devise from scratch a subject naming system seems inefficient and unlikely to be very useful since tagging by subject is subjective to some degree. Are there any existing systems out there that provide a framework to work within? Off the top of my head I can only think of something like the Dewey Decimal or Library of Congress subject headings but these are rather unwieldy. Perhaps there is software that indexes documents automatically or that can scan them and give suggestions as to the relevancy of its contents to various predefined subjects. Any help on this problem will be appreciated, be it book,
web site, association, or whatever.

Thanks, Steve

******************************************************************

III. NOTICES

III.A.1.
Fr: Karen Smith <smithkar@htdc.org>
Re: Parsers From Around The World

Hello everyone!

I have recently designed a web page that may be of interest to some of you. This web page is a guided tour of parsers of from around the world and it helps you compare them. To my knowledge, nothing like this has ever been done and hopefully it will serve as a reference for those of you working on parsing technology. The site is located at
http://www.ergo-ling.com. If anyone knows of others that should be added to this site, please let me know so I can promptly add them. Please respond personally with any comments or questions you might have concerning this site.

Thank you,
Karen L. Smith
Linguist
Ergo Linguistic Technologies

**********

III.A.2.
Fr: Charles W. Bailey, Jr. <cbailey@UH.EDU>
Re: Version 27, Scholarly Electronic Publishing Bibliography

Version 27 of the Scholarly Electronic Publishing Bibliography is now available. This selective bibliography presents over 1,000 articles, books, electronic documents, and other sources that are useful in understanding scholarly electronic publishing efforts on the Internet and other networks.

HTML: <URL:
http://info.lib.uh.edu/sepb/sepb.html>
Acrobat: <URL:
http://info.lib.uh.edu/sepb/sepb.pdf>
Word 97: <URL:
http://info.lib.uh.edu/sepb/sepb.doc>

The HTML document is designed for interactive use. Each major section is a separate file. There are live links to sources available on the Internet. It can be can be searched using Boolean operators.

The HTML document also includes Scholarly Electronic Publishing
Resources, a collection of links to related Web sites:

<URL:
http://info.lib.uh.edu/sepb/sepr.htm>

The Acrobat and Word files are designed for printing.
Each file is over 250 KB.
(Revised sections in this version are marked with an asterisk.)
Table of Contents
1 Economic Issues*
2 Electronic Books and Texts
2.1 Case Studies and History*
2.2 General Works
2.3 Library Issues
3 Electronic Serials
3.1 Case Studies and History
3.2 Critiques
3.3 Electronic Distribution of Printed Journals
3.4 General Works*
3.5 Library Issues*
3.6 Research*
4 General Works*
5 Legal Issues
5.1 Intellectual Property Rights*
5.2 License Agreements*
5.3 Other Legal Issues
6 Library Issues
6.1 Cataloging, Classification, and Metadata*
6.2 Digital Libraries*
6.3 General Works*
6.4 Information Conversion, Integrity, and Preservation*
7 New Publishing Models*
8 Publisher Issues*
8.1 Electronic Commerce/Copyright Systems
Appendix A. Related Bibliographies by the Same Author
Appendix B. About the Author

Best Regards,
Charles
Charles W. Bailey, Jr., Assistant Dean for Systems,
University Libraries, University of Houston, Houston, TX
77204-2091. E-mail: cbailey@uh.edu. Voice: (713) 743-9804.
Fax: (713) 743-9811.
<URL:
http://info.lib.uh.edu/cwb/bailey.htm>
<URL:
http://info.lib.uh.edu/sepb/sepb.html>

**********

III.B.1.
Fr: Dirk Bartz  <bartz@gris.uni-tuebingen.de>
Re: IEEE Vis'99: Early Registration Deadline

Second C A L L  F O R  R E G I S T R A T I O N
Early registration deadline is coming up

Vis99 IEEE Visualization 1999
Celebrating Ten Years
Call for Participation
October 24 - October 29, 1999
San Francisco Airport Hyatt
San Francisco, California
http://www.erc.msstate.edu/vis99
http://davinci.informatik.uni-kl.de/vis99/
THE INSTITUTE OF ELECTRICAL & ELECTRONICS ENGINEERS, INC.
IEEE COMPUTER SOCIETY
Sponsored by the
IEEE Computer Society Technical Committee on Visualization and Graphics
In Cooperation with ACM/SIGGRAPH

WELCOME TO IEEE VISUALIZATION '99
Welcome to the 10th annual IEEE Visualization conference! Please join us in our celebration of 10 years of leading edge research in scientific visualization by returning to the city where we started, San Francisco, California. In the past decade the field of scientific visualization and the IEEE visualization conference have grown in both size and sophistication. Reflecting this growth, our keynote talk will describe a series of workshops by the leaders in the scientific visualization field, resulting in "Data and Visualization Corridors". This talk will lay out some of the greatest problems facing our field today and strategies for their solution. Our capstone speaker, Rick Stevens, looks into the future with a presentation of "ActiveSpaces: The Access Grid, ActiveMural and Advanced Visualization
Systems". Between these two talks you will find reports on the state of the art in data visualization, including new research, case studies and vociferous and opinionated panels. You will also be able to see and try the new technologies close up in the demonstration area and the Creative Applications Lab (CAL). Preceding the main conference on Wednesday through Friday are the Sunday through Tuesday sessions. In these sessions you will find several tutorials by leaders in the field, providing you with an opportunity for learning the background of their research, as well as studying their new work in depth. You will also find two symposia, the symposium on information visualization, where techniques for the visualization of abstract information will be studied, and the symposium on parallel visualization and graphics, where you will see new work in the application of parallel computer systems to our field. Our 10th conference celebration will be loudest at the Wednesday night banquet, where we will have special entertainment as well as recognition of those who started the visualization conference and nurtured it to its current success. Please join us in San Francisco, California October 24 through 29, to make this celebration memorable at the 10th visualization conference!

Steve Bryson, NASA Ames Research Center
Theresa-Marie Rhyne, Lockheed Martin/U.S.,
EPA Scientific Visualization Center
IEEE Visualization '99 Conference Co-Chairs
Please check the web pages at
http://WWW.ERC.MsState.Edu/conferences/vis99/advanceprogram/registration.htm
or
http://davinci.informatik.uni-kl.de/vis99/advanceprogram/registration.htm
for details on registration.

Sunday / Monday / Tuesday Registration
You may attend any tutorial or symposium on the day(s) registered. Registration includes Parallel Visualization and Graphics proceedings, Information Visualization proceedings, Monday reception, Wednesday Keynote, Wednesday-Thursday demonstrations, the conference videotape, and CD. All tutorial, symposia, and conference proceedings and notes are on the Vis99 CD.

Early Registration Late/Onsite Registration (received by Oct. 1) (Oct. 2 or later)

**********

III.B.2.
Fr: Joan K Lippincott <joan@cni.org>
Re: CNI/CIC: Working Together Workshop

CNI and the CIC announce Working Together: A Workshop for Archivists, Records Managers and Information Technologists, which will be held on Thursday and Friday, November 18-19, 1999 at the University Place Conference Center at IUPUI in Indianapolis, IN.

In an intensive, participatory workshop, archivists, records managers, and information technologists will look at the issues of digital preservation and access, and discuss the management of electronic records in their own institutional context. Sessions will include presentations, small group work, and a keynote by CNI's Executive Director Clifford Lynch.

The emphasis of this workshop is on teamwork -- bringing together teams of archivists, records managers, and information technologists who will begin to develop practical plans for electronic records management that can be implemented at their own institutions. Librarians with archive or records management responsibilities are also eligible and encouraged to participate as part of an institutional team.

Gerry Bernbom, Indiana University and Fynnette Eaton,
Smithsonian Institution Archives, will facilitate the workshop.

PARTICIPANTS MUST REGISTER AS INSTITUTIONAL TEAMS - one or more archivists or records managers and one or more information technologists per institution. Each team member must register separately via the website at
www.cni.org. REGISTRATION IS LIMITED to 40 participants and will be on a first-come, first-served basis. The REGISTRATION DEADLINE is OCTOBER 15.

For additional information, contact Joan Lippincott at
joan@cni.org or 202-296-5098.

Joan K. Lippincott, Associate Executive Director
Coalition for Networked Information
21 Dupont Circle, N.W.
Washington, D.C. 20036
(202) 296-5098
FAX: (202) 872-0884
Internet: <joan@cni.org>
<
http://www.cni.org/>

**********

III.B.3.
Fr: Lee-Feng Chien <lfchien@iis.sinica.edu.tw>
Re: IRAL '99: CFPARTICIPATION

CALL FOR PARTICIPATION
The Fourth International Workshop on
Information Retrieval with Asian Languages - IRAL'99
11-12 November, 1999
Academia Sinica, Taipei, Taiwan

Sponsored by
Institute of Information Sciences, Academia Sinica, Taiwan
Association for Computational Linguistics and Chinese Language Processing
URL: http://www.iis.sinica.edu.tw/~IRAL99/

ABOUT THE WORKSHOP
The purpose of the IRAL workshop is to bring together researchers and developers who are interested in exchanging new ideas and presenting results in the field of information retrieval (IR), with an emphasis on the issues related to Asian languages and multilingual applications. The first International Workshop was held in 1996 in Taejon, Korea, with the name "Information Retrieval with Oriental Languages". The second and the third workshops were held in Japan and Singapore in 1997 and 1998 respectively. To further facilitate interchange between the information retrieval and digital library communities, the fourth workshop will be held in Academia Sinica Taipei on November 11-12, 1999 directly after the Second Asian Digital Libraries Conference (
http://www.lis.ntu.edu.tw/adl99/). For more details, please see web site at http://www.iis.sinica.edu.tw/~IRAL99/

ABOUT THE BIG EARTHQUAKE
As you might know that there occurred a big and damaging earthquake on Sept. 21 in the middle area of the Taiwan island. Thank you for great help from people of the island and the world; Taipei has apparently got back to normal. In fact, the most damaged area was in middle Taiwan, and there was only one building in the Taipei city crushed in the earthquake. The conference location, the campus of Academia Sinica, which is not far from the center of Taipei, also has no damage. The IRAL'99 organization committee therefore believes it will have no problem organizing the conference as the original program below.

PRELIMINARY PROGRAM
November 11 (Thursday)
08:30 - 09:00 Registration
09:00 - 09:20 Opening Ceremony, Der-Tsai Lee, Academia Sinica, Taiwan
09:20 - 10:20 Keynote Speech: "Global Access to Multilingual Information", Prof. Douglas W. Oard, U. of Maryland, USA
Session Chair, Lee-Feng Chien, Academia Sinica, Taiwan
10:20 - 10:50 Coffee Break
10:50 - 12:20 Session I: Summarization, CLIR and Link-based Retrieval
Session Chair: Kam-Fai Wong, Chinese University, Hong Kong
1. "A Summarization System for Chinese News from Multiple Sources",
Hsin-Hsi Chen, Sheng-Jie Huang, National Taiwan University, Taiwan
2. "Term Similarity-Based Cross-Language Document Retrieval and
Summarization",Mirna Adriani, C.J. van Rijsbergen, University of Glasgow, Glasgow, Scotland
3. "Improving Efficiency with Document Category Information in Link-based Retrieval",Jeong-Mook Lim, Hyo-Jung Oh, Sung Hyon Myaeng, Mann-Ho Lee, Chungnam National Univ., Korea
12:20 - 13:30 Lunch
13:30 - 15:00 Session II: Text Retrieval (I)
Session Chair: Sung-Hyon Myaeng, Chungnam Nat. Univ., Korea
1. "A Text Data Mining Approach Using a Chinese Corpus Based on
Self-Organizing Map", Chung-Hong Lee, Hsin-Chang Yang, Chang Jung
University, Taiwan
2. "An Efficient Storing Method of Compound Keywords Using Extendible
Hashing", Hisatoshi Mochizuki, Yoshitaka Hayashi, Osaka Prefectural
College of Technology and SUMITOMO METAL Systems Development Co.. Japan
3. "Coding Grams of Variable Length into Fixed Bytes - For Fast Full Text Retrieval with Compact Indices", Takashi SATO, Kazuo SUGIHARA,
Susumu YOSHITAKE, Osaka Kyoiku University, Japan
15:00 - 15:30 Coffee Break
15:30 - 16:00 Invited Speech
Session Chair: Mei-Mei Wu, Nat. Taiwan N. Univ
"The NTCIR Workshop : the first evaluation workshop on Japanese text retrieval and cross-lingual information retrieval", Noriko Kando, Kazuko Kuriyama, Toshihiko Nozue, Koji Eguchi, Hiroyuki Kato, Soichiro Hidaka, Jun Adachi, National Center for Science Information Systems, Japan
16:30 - 18:00 Session III: Multimedia IR
Session Chair: Hsinchun Chen, Univ. of Arizona, USA
1. "Semantic Analysis of Television News Captions Referring to Suffixes", Ichiro IDE, Reiko HAMADA, Shuichi SAKAI, Hidehiko TANAKA, University of Tokyo, Japan
2. "Structural Analysis of Preparation Steps on Supplementary Documents on Cultural TV Programs", Reiko HAMADA, Ichiro IDE, Hidehiko TANAKA,
University of Tokyo, Japan
3. "Retrieval of Mandarin Spoken Documents Based on Syllable Lattice
Matching", Hsin-min Wang, Academia Sinica, Taiwan
4. "Weight Assignment in Dissimilarity Function for Chinese Cursive Script Character Image Retrieval Using Genetic Algorithm", David Yuk-Ming Chan, Irwin King, The Chinese University of Hong Kong, Hong Kong
18:30 - 20:30 Reception

November 12 (Friday)
09:00 - 10:00 Keynote Speech:
Prof. Hiroya Fujisaki,Science University of Tokyo, Japan
Session Chair: Lin-Shan Lee, NTU, Taiwan
10:00 - 10:30 Coffee Break
10:30 - 12:20 Session IV: NLP for IR
Session Chair: Hsin-Hsi Chen, Nat. Taiwan University, Taiwan
1. "Using syntactic information in handling natural language queries for extended Boolean retrieval model", Geunbae Lee, Mihwa Park, Hyungsuk Won, Pohang University of Science & Technology, Korea
2. "Extraction of Time-related Concepts for Intelligent Information Retrieval in Chinese", Wenjie Li, Wai-Ip Lam, Kam-Fai Wong, The Chinese
University of Hong Kong, Hong Kong
3. "PAT-tree-based Online Corpus Collection and Classification",
Chun-Liang Chen, Lee-Feng Chien, National Taiwan University and
Academia Sinica, Taiwan
12:00 - 13:30 Lunch and PC Meeting
13:30 - 15:00 Session V- Term Extraction and Analysis
Session Chair: Jun Adachi, NACSIS, Japan
1. "Measuring Representativeness of Terms",
Toru Hisamitu, Yoshiki Niwa, Jun-ichi Tsujii, Hitachi, Ltd. and University of Tokyo, Japan
2. "An Efficient Method of Determining Relationships among Compound Keywords Using Machine-AC", Yoshitaka Hayashi, Hisatoshi Mochizuki, SUMITOMO METAL Systems Development Co. and Osaka Prefectural College of Technology, Japan
3. "Detecting Japanese Term Variation in Textual Corpus ", Fuyuki Yoshikane, Keita Tsuji, Kyo Kageura, Christian Jacquemin, University of Tokyo, Japan
15:00 - 15:30 Coffee Break
15:30 - 17:30 Session VI: Text Retrieval (II)
Session Chair: Geunbae Lee, POSTECH, Korea
1. "A Document Similarity Measurement without Dictionaries",
Chung-Chen Chen, Wen-Lian Hsu, Academia Sinica, Taiwan
2. "A Hybrid Indexing Strategy using Words and Bigrams",
T.F.Tsang, Robert W.P. Luk, K.F. Wong, Hong Kong Polytechnic
University, Hong Kong
3. "Efficient Building and Querying of Asian Language Document Databases", Phil Vines, Justin Zobel, RMIT University, Australia
4. "Text Structure Analysis as a Tool to Make Retrieved Document Usable", Noriko Kando, National Center for Science Information Systems, Japan
19:00 - 21:00 Banquet

Online Registration
To register, please use the on-line registration system at
URL:
http://www.iis.sinica.edu.tw/~IRAL99. Please print out the completed form generated by the system with signature and then fax (or send) to:
Miss Jessie Huang (IRAL'99 Registration)
Institute of Information Science, Academia Sinica N0. 130, Academia Sinica Road, Sec.2, Nankang Taipei, Taiwan 115
Fax: 886-2-2788-1638
Note that the organizing committee only charges the registration fee via credit cards and deposit for room reservation is not required.

CONFERENCE ORGANIZATION
GENERAL CHAIR:
Der-Tsai Lee, Academia Sinica, Taiwan
GENERAL CO-CHAIR:
Lin-Shan Lee, National Taiwan University, Taiwan

******************************************************************

IV. PROJECTS

IV.C.1.
Fr: Maria Zemankova <mzemanko@nsf.gov>
Re: NSF Information Technology Research (ITR):
    Letters of Intent 11/15/99 or 01/05/2K

Information Technology Research (ITR), NSF 99-167
http://www.nsf.gov/cgi-bin/getpub?nsf99167

DEADLINE DATES:
Proposals WITH BUDGETS OVER $500K
LETTERS OF INTENT (Required): NOVEMBER 15, 1999
PRE-PROPOSALS: JANUARY 5, 2000
FULL PROPOSALS: APRIL 17, 2000
PROPOSALS WITH BUDGETS NOT EXCEEDING $500K
LETTERS OF INTENT (Required): JANUARY 5, 2000
FULL PROPOSALS: FEBRUARY 14, 2000

Short Description/Synopsis of Program:
Over the past decade, the Federal Government has provided crucial investments in information science and engineering, such as the High Performance Computing and Communications (HPCC) Program and the Partnerships for Advanced Computational Infrastructure (PACI). The recent report of the President's Information Technology Advisory Committee (PITAC) noted the important contribution this investment has made, but cautioned that federal support for long-term research on information technology has been "dangerously inadequate." This report has led the Administration to seek additional funding research in information technology. Congressional bill HR 2086 also calls for NSF to make grants for "long-term basic research on networking and information technology, with priority given to research that helps address issues related to high end computing and software and network stability, fragility, reliability, security (including privacy), and scalability."

This solicitation requests proposals for fundamental research in information technology, encouraging in particular research spanning information technology and scientific applications, and in the area of social, ethical and workforce issues. Pending availability of funds, a separate solicitation will be issued for a terascale computer facility for high-end science and engineering.

NSF encourages multi-disciplinary and multi-institutional proposals as well as proposals with international cooperation. NSF has particular responsibility to address educational and community-extending activities. Thus NSF expects that proposals will address and include one or more community-extending concepts such as undergraduate education or links to minority serving institutions, or institutions in EPSCOR states.

CISE Cognizant Program Officer:
Michael Lesk, Division Director, Information and Intelligent Systems (IIS),
Computer and Information Science and Engineering (CISE)
Telephone 703-306-1930, e-mail: mlesk@nsf.gov.

******************************************************************

IRLIST Digest is distributed from the University of California, California Digital Library, 1111 Franklin Street, Oakland, CA. 94607-5200.

Send subscription requests and submissions to: nancy.gusack@ucop.edu

Editorial Staff:
Nancy Gusack nancy.gusack@ucop.edu
Cliff Lynch (emeritus) cliff@cni.org

The IRLIST Archives is set up for anonymous FTP. Using anonymous FTP via the host hibiscus.ucop.edu, the files will be found in the directory /data/ftp/pub/irl, stored in subdirectories by year (e.g., data/ftp/pub/irl/1993).

Search or browse archived IR-L Digest issues on the Web at:
http://www.dcs.gla.ac.uk/idom/irlist/

These files are not to be sold or used for commercial purposes. Contact Nancy Gusack for more information on IRLIST. THE OPINIONS EXPRESSED IN IRLIST DO NOT REPRESENT THOSE OF THE EDITORS OR THE UNIVERSITY OF CALIFORNIA. AUTHORS ASSUME FULL RESPONSIBILITY FOR THEIR MATERIAL.