TPDL 2011 Workshop 2829 September Berlin Living In the KnowlEdge Society the double duty of a librarian by Edward A Fox foxvtedu http foxcsvtedu Dept of Computer Science Virginia Tech ID: 913912
Download Presentation The PPT/PDF document "1 Linking Research and Education in Dig..." is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
1
Linking Research and Education in Digital LibrariesTPDL 2011 Workshop28-29 September, Berlin“Living In the KnowlEdge Society:the double duty of a librarian”by Edward A. Fox
fox@vt.edu
http://
fox.cs.vt.edu
Dept. of Computer Science, Virginia Tech
Blacksburg, VA 24061 USA
Slide2Outline
AcknowledgmentsDigital libraries, 5SLIKES, CTRnetCurricular effortsBook efforts2
Slide3Acknowledgements
Mentors (Licklider, Kessler, Salton)Virginia Tech, CS, Digital Library Research LabNSF and other sponsorsStudents, colleagues, co-investigatorsMonika Akbar, Yinlin Chen, Marcos André Gon-çalves, Doug Gorton, Tarek Kanan, Nadia Ko- zievitch, Spencer Lee, Jonathan Leidig, Yi Ma, Uma Murthy, Sung Hee Park, Rao
Shen
,
Ven-kat
Srinivasan
, Ricardo Torres, Xiaoyan Yu, ...Barbara Wildemuth, Jeffrey Pomerantz, Sanghee Oh, Seungwon Yang
3
Slide4Acknowledgements – 2
VT Digital Library Research LaboratoryDigital LibrariesInformation RetrievalHuman-Computer InteractionMultimedia / Hypermedia / ePublishingComputing-related education4
Slide5Selected DL Projects
Digital Library Curricular ResourcesNSF IIS-0535057 & 0535060CTRnet (Crisis, Tragedy & Recovery Net)NSF IIS-0916733Ensemble (Computer Science Education)NSF DUE-0840719Digital PreserveNSF IIS-0910183 & 0910465http://slurl.com/secondlife/Digital%20Preserve/140/126/295
Slide6Selected DL Projects - 2
CINET: Network Science MiddlewareNSF SDCI 1032677Simulation, CyberinfrastructureMetadata-based GenerationEstablishing a Qatari Arabic-English DL Institute: NPRP 4 - 029 - 1 – 007 pendingFingerprint Analysis/Distortion/Training DLsNational Inst of Justice, BAE SystemsETD Analysis, Extraction, Classification6
Slide72 duties of digital librarians
Raise ability of patronsLiteracy -> facility -> computational thinkingUniversity graduates -> general publicMake content accessible now & foreverDiscover, explore, search, browse (libraries)Preserve (archives and museums)What to do as a digital librarian?How to prepare to be a digital librarian?7
Slide8Computing
(flops)
Digital
content
Communicat
i
ons
(bandwidth, connectivity)
Locating Digital Libraries in Computing and
Communications Technology Space
Digital Libraries technology
trajectory:
intellectual
access to globally distributed information
less
more
Note:
we should consider 4 dimensions: computing, communications,
content, and community (people)
Slide99
Information Life CycleAuthoringModifyingOrganizingIndexing
Storing
Retrieving
Distributing
Networking
Retention
/ Mining
Accessing
Filtering
Using
Creating
Creation
Searching
Utilization
Active
Inactive
Semi-
Active
Social Context
Slide1010
Informal 5S & DL Definitions DLs are complex systems thathelp satisfy info needs of users (societies)provide info services (scenarios)organize info in usable ways (structures)present info in usable ways (spaces)communicate info with users (streams)
Slide1111
Digital Object
Repository
Collection
Minimal DL
Metadata Catalog
Descriptive Metadata Specification
A Minimal DL in the 5S Framework
Structural Metadata Specification
Streams
Structures
Spaces
Scenarios
Societies
indexing
browsing
searching
services
hypertext
Structured Stream
Slide1212
Slide1313
Quality Dimensions
Slide1414
5S
Meta
Model
5SGraph
DL
Expert
DL
Designer
5SL
DL
Model
5SLGen
Practitioner
Researcher
Tailored
DL
Services
Teacher
c
omponent
pool
ODLSearch,
ODLBrowse,
ODLRate,
ODLReview,
…….
Requirements (1)
Analysis (2)
Implementation (4)
Design (3)
5SGraph
5SGen
Mapping Tool
5SSuite for DL R&D
Slide15Digital Libraries --- Objectives
World Lit.: 24hr / 7day / from desktopIntegrated “super” information systems: 5S: Table of related areas and their coverageUbiquitous, Higher Quality, Lower Cost Education, Knowledge Sharing, DiscoveryDisintermediation -> Collaboration Universities Reclaim PropertyInteractive Courseware, Student WorksScalable, Sustainable, Usable, Useful
Slide16DL Overview
Why of Global Interest?
National projects
can preserve antiquities and heritage: cultural, historical, linguistic, scholarly
Knowledge and information are essential to economic and technological
growth, education
DL - a
domain for international collaboration
wherein all can
contribute
and
benefit
which leverages investment in
networking
which provides useful
content
on Internet & WWW
which will
tie nations and peoples together
more strongly and through
deeper understanding
Slide17Outline
AcknowledgmentsDigital libraries, 5SLIKES, CTRnetCurricular effortsBook efforts17
Slide18Living In the
KnowlEdge Society (LIKES)North Carolina A & TSanta Clara UniversityVillanova University
Virginia Tech
NSF CPATH: CCF-0722259,
0722276,
0722289,
and 0752865
Slide19LIKES Workshops
SCU, 2007: Defining Problems & Applications of Knowledge SocietyNC A&T, 2008: Biology, Geography, Music, Physics, Statistics, University StudiesVT, 2008: defining key terms related to knowledge society, identifying key computing concepts, mapping disciplinary needs with computing conceptsVillanova, 2009: interdisciplinary connections, modules/tools/pedagogy/assessmentVT, 2009: Curricular Guidelines Connecting Computing with Other DisciplinesDurham, 2010: biology, chemistry, physics, computational science, business/social sciences19
Slide20LIKES Vision
Build a community leading the way to change how computing concepts are taught in both computing-related disciplines and the disciplines of the broader workforce and society.
Slide21Overall LIKES Objectives
Students should have the ability to apply Computing conceptsMethodsComputational thinking skillsto the needs of the emerging knowledge society, in modern times andin the future
21
Slide22Computing Concepts of Broad Interest (to other disciplines)
data, information, and knowledgealgorithms, analysis, problem solving, programming, work flows, and software engineeringinteraction, interfaces, graphics, games, visualization, and virtual environmentsmodeling and simulation22
Slide23Disciplines
KnowledgeSociety
HCI
Visualization
Knowledge
Management
Systems
Analysis
& Design
Programming
Database
Algorithms
Architecture
Net-Centricity
Intelligent
Systems
Social &
Ethical
Library /
Information
Science
Sociology
Simulation
Commun-
ications
Political
Science
Archi-
tecture
Health-
care
Economics
Finance
Psychology
Marketing
Physics
Music
Engi-
neering
History
Biology
Art
Chemistry
Geography
Math
Geology
English
Slide24Applications
KnowledgeSociety
HCI
Visualization
Knowledge
Management
Systems
Analysis
& Design
Programming
Database
Algorithms
Architecture
Net-Centricity
Intelligent
Systems
Social &
Ethical
Library
Information
Science
GIS
Simulation
Online
Shopping
Multi
Media
Semantic
Web
CSCW
Digital
Government
Healthcare
Services
Slide2525
Human tragedies that result from man-made and natural events affect humans and communities significantly.During and after a tragic event, there are a series of needs that have to be addressed.Compounded by communication failures and a confusing plethora of data and informationCrisis, Tragedy, and Recovery
Slide2626
Build a networked digital library relating to CTRSupport information explorationAided by an ontology
Integrate community, content, and services relating to CTR, making it accessible, and preserving it for long-term reuse
www.citeulike.org
group
ctrnet
Citations
Papers, …
www.ctrnet.net
Slide27Goals for Ontology for CTR
27
Social network
applications
CTR literature
Focus groups
Websites,
Internet Archive
Browsing
Searching
Query
expansion
Visualizing
Tagging
Summarizing
CTR Ontology
Individual
Organizational
Community
Political
…
Multicultural/ linguistic input
Recommending
sources
uses
Slide28Generalizing CTRnet
Precursor: www.dl-vt-416.orgSequel: Event ArchivingRelated to work of national libraries to archive internet of interest (as in plans for Qatar Digital Library Institute)Related to International Internet Preservation Consortium (IIPC, www.netpreserve.org)Related to Archive-it (www.archive-it.org) and its Spontaneous Events collections (seehttp://www.archive-it.org/public/topic.html?topic=spontaneousEvents)
28
Slide29Outline
AcknowledgmentsDigital libraries, 5SLIKES, CTRnetCurricular effortsBook efforts29
Slide3030
CS2008 Information Management Areas beside DL#
Information models*
Transaction processing
Database
systems*
Distributed DBs
Data
modeling*
Physical DB design
Indexing
Data mining
Relational DBs
Information storage and retrieval#
Query languages
Hypermedia
Relational DB design
Multimedia systems
* Core
components #DL moved to ISR
for 2013
Slide3131
DL Curriculum Framework
Slide32DL Curric. Project - 1
NSF awards to VT and UNC-CHCS and LISProject server: http://curric.dlib.vt.edu/Wikiversity: http://en.wikiversity.org/wiki/Curriculum_on_Digital_Libraries32
Slide33DL Curric
. Project - 2Module 1-a (10-c): Conceptual frameworks, models, theories, defnsModule 1-b: History of digital libraries and library automationModule 2-c (8-d): File Formats, Transformation, and MigrationModule 3-b: DigitizationModule 3-e (7-e): Web publishingModule 4-b: MetadataModule 5-a: Architecture overviews33
Slide34DL Curric
. Project - 3Module 5-b: Application softwareModule 5-d: ProtocolsModule 6-a: Information needs/relevanceModule 6-b: Online information seeking behaviors and search strategiesModule 6-d: Interaction design and usability assessmentModule 7-a: Indexing and searchingModule 7-a(1): Image retrieval34
Slide35DL Curric
. Project - 4Module 7-b: Reference ServicesModule 7-c: Recommender systemsModule 7-d: RoutingModule 7-f: CrawlingModule 7-g: PersonalizationModule 8-a: PreservationModule 8-b: Web archivingModule 9-c: Digital library evaluation, user studies35
Slide36DL
Curric. Project – 5Modules and Cloud Instances36IRApache SolrLemurIRWordNet
R
IR
NLTK
SEDNA XML DB
IR
CLUTOWeka
IR
TREC
Eval
Hadoop
Map-Reduce
MM
Media Computation
Audacity
MM
PureData
Fingerprint
Slide37Module Development – What?
Digital LibrariesInformation Retrieval tools (cloud)Multimedia tools (cloud)Biometrics TrainingEspecially fingerprint analysis37
Slide38Module Development – Who?
ExpertsDLBiometricsTeams in a 6000-level DL Course: 4Teams in a 5000-level IR Course: 5 (+5)Teams in a 4000 MM Course: 438
Slide39Outline
AcknowledgmentsDigital libraries, 5SLIKES, CTRnetCurricular effortsBook efforts39
Slide4040
Old Planned Book: PartsCh. 1. Introduction (Motivation, Synopsis)Part 1 – The “Ss”Part 2 – Higher DL ConstructsPart 3 – Advanced TopicsAppendix
Slide4141
Old Planned Book: Part 2Part 2 – Higher DL ConstructsCh. 7: CollectionsCh. 8: CatalogsCh. 9: Repositories and ArchivesCh. 10: ServicesCh. 11: SystemsCh. 12: Case Studies
Slide4242
Old Planned Book: Part 3 …Part 3 – Advanced TopicsCh. 13: QualityCh. 14: IntegrationCh. 15: How to build a digital libraryCh. 16: Research Challenges, Future PerspectivesAppendixA: Mathematical preliminariesB: Formal Definitions: Ss C: Formal Definitions: DL terms, Minimal DLD: Formal Definitions: Archeological DLE: Glossary of terms, mappings
Slide4343
Book(s) for 2012Morgan-Claypool lecture manuscript for Synthesis digital library series – plannedBook for CS6604, Digital Libraries, Fall 2011See highlights in next slidesBeing considered by publishers for 2012 releaseAccompanied by slides and other supplemental materialsAvailable for test use for classes in January
Slide4444
Book Draft Chapter AuthorsMonika Akbar, Pranav Angara, Yinlin Chen, Lois M. Delcambre, Noha Elsherbiny, Eric Fouh, Marcos André Gonçalves, Nádia P. Kozievitch, Spencer Lee, Jonathan Leidig, Lin Tzy Li, Mohamed Magdy Gharib Farag, Uma
Murthy, Sung
Hee
Park,
Rao
Shen, Venkat Srinivasan, Ricardo da Silva Torres, and Seungwon Yang
Slide45Book Draft - 1
Frontmatter1 Basic Concepts1 Introduction2 Exploration3 Evaluation2 Advanced Concepts4 Complex objects5 Integration6 Subdocuments7 Ontologies8 Classification45
Slide46Book Draft - 2
3 Applications9 Content-based Image Retrieval (CBIR)10 Online Communities and Social Networks11 Education12 Bioinformatics, Scientific, and Simulation DLs13 Geospatial Information14 Security15 Text ExtractionBackmatter: References, Mathematical Preliminaries, Glossary
46
Slide4747
Slide48Summary
AcknowledgmentsDigital libraries, 5SLIKES, CTRnetCurricular effortsBook efforts48
Slide4949
Questions?Discussion?Thank You!(fox@vt.edu)