Notes
Slide Show
Outline
1
Open Video Project Overview
  • INLS 235
  • Spring 2003
2
Open Video Project
  • Goals
    •  Create an open source DL for use by researchers, students, and the public.
    • A testbed for interactive interfaces
    • An environment for building theory of human information interaction
  • Ongoing work: begun 1995 with colleagues at UMD
  • Current funding: NSF# IIS-0099538, NCNI
  • Collaborators/Contributors: I2-DSI, ibiblio, CMU, UMD, NIST, Internet Archive, NASA
  • www.open-video.org
3
 
4
Current Status
  • ~ 0.5 TB of content
  • ~2000 video segments
  • ~1200 different titles
  • ~1800 unique visitors per month
  • I2-DSI video channel
  • OAI provider
  • Ongoing user studies
5
 
6
Backend Tools and Services
  • Workstations, servers, disk arrays
  • Tape players (VHS, Beta SP), digitization boards (e.g., Broadway), and software for AVI/MOV to MPEG-1, MPEG-2, and QuickTime
  • Bandwidth (UNC-CH switched ethernet)
  • Linux OS, PHP scripting language, MySQL DBMS, Apache server
7
Backend Tools and Services (cont’)
  • Merit (UMCP UMIACS), ported to Linux to extract candidate keyframes
  • Speech to text (e.g., Sphinx at CMU)
  • VAST keyframe/posterframe extraction, selection, and management
  • Transaction logs and scripts (for evaluation and for recommenders)
  • Peer to peer exchange
  • ISEE (shared remote video use, e.g., DE)
  • Indexer workstation


8
Tools and Services for User Studies
  • Database driven web pages for user interaction
  • Usability workstation (multiple camera, mixer, VCR)
  • eye tracking system
  • Speech synthesis (for audio keywords)


  • Java and Perl scripts for managing, moving files, managing server (security, upgrades, etc.)
9
Agile Views Interface
  • Provide a variety of access representations (e.g., indexes) and control mechanisms
  • Usual search and browse capabilities
  • Leverage both visual and linguistic cues
  • Create and test surrogates for overview and preview
10
Browse: by Categories & Attributes
11
Search: by Category & Attribute
12
Search: by Free Text & Keyword
13
Search Results
14
Segment Details
15
Video Transcript Text
16
Video Segment Preview
17
AgileViews Overview – Genre: Documentary
18
AgileViews Overview – Genre: Education
19
AgileViews Overview – Color/B&W
20
Previews
21
Agile Views Preview – Faces
22
Agile Views Preview – Faces
23
Agile Views Preview – Superimposition
24
Agile Views Preview – Brightness
25
User Study Research Agenda
26
Exploratory Study
  • What are the strengths and weaknesses of different surrogates from the users’ perspective?
  • Are any of the surrogates better than the others in supporting user performance?
27
The Surrogates
  • Storyboard with text keywords (20-36 per board@ 500 ms)
  • Storyboard with audio keywords
  • Slide show with text keywords (250ms repeated once)
  • Slide show with audio keywords
  • Fast forward (~ 4X)
28
Method
  • 7 video segments (2-10 min), 5 surrogates created for each
  • 10 subjects with high video and computer experience
  • Three phases (all multi-camera videotaped)
    • View full video then use 3 surrogates, repeat
      • Participant observation and debriefing
    • Do NOT view full video, use 3 surrogates, repeat
      • Participant observation and debriefing
    • Complete 3 assigned tasks with surrogates of choice
      • Think aloud and debriefing
  • http://www.open-video.org/experiments/chi-2002/methods/study1.mov



29
Tasks
  • Gist determination—free text
  • Gist determination—multiple choice
  • Object recognition—textual
  • Object recognition—graphical
  • Action recognition (2-3 second clips)
  • Visual gist (predict which frames belong)
    • http://www.open-video.org/experiments/chi-2002/surrogates/index.html


30
Preferences
  • In debriefing after each phase, subjects asked about preferences.
  • Some preferences changed over the phases
  • 2 subjects preferred ff
  • 4 subjects said ff if audio keywords added
  • 1 storyboard with audio keywords
  • 2 slide show with audio keywords
  • à drop ss with text keywords, develop ff
31
Performance
  • No SRD on gist (both free text and multiple choice)
  • SRD on action recognition favoring ff
  • ‘Near’ SRD on text object recognition favoring SB/w audio keywords
  • 4:1 to 29:1 compaction rates suitable for tasks
  • Psychometric and face validity support for the tasks (means and variances; relevant to real tasks)
  • SRD in gist and visual gist for one video
    • àHomogeneity of frames diminishes surrogate value
    • àKeywords help when visual variability decreases
32
Qualitative Results
  • Subjects suggested different surrogates for different tasks (e.g., ff for judging kid safe, sb for identifying images, ff for video styles)
  • Three senses of gist
    • Topic (T)
    • Narrativity (N)
    • T+N+visual style
  • Individual preferences and experiences influence surrogate effectiveness




33
Fast Forward Study
  • How fast can we make fast forwards?
    • 4 ff conditions (32X, 64X, 128X, 256X)
    • Four video segments for each condition
    • 45 subjects
    • 6 tasks (full text gist, multiple choice gist, word object recognition, graphical object recognition, action recognition, visual gist)
34
Preliminary Results
  • SRD on 4 of 6 tasks as speed increases, however, reasonable performance at even the highest rate
  • Video content/genre interacts with performance
  • Preference does not parallel performance (people can perform well under extreme conditions but do not like/enjoy)
  • àGive users control but select appropriate defaults
35
Next Steps
  • Poster frame and keyword placement effects using eye-tracking
  • Integrate surrogates into production system
  • User studies with overall system
  • New tools
    • Shared video study environment (ISEE)
    • Peer to peer sharing
    • Indexer’s toolkit
    • Audio??
  • Continue to build and sustain Open Video
36
Summary
  • Give people many ‘views’ to look ahead
  • Make these views easy to manipulate (agile)
  • Challenges
    • Mapping video characteristics to surrogates (e.g., keyframes, keywords), mapping surrogates to control mechanisms (e.g., mouse actions)
    • Automating production processes