Information Science 220

Information Storage and Retrieval

Fall 2013


Instructor: David Vampola

Office: 424 Shineman Center

Email: vampola@cs. oswego. edu

Telephone: (315) 312-2689 (Note that I do not have voicemail)

Office Hours: Tuesday, Thursday 10:00 - 11:00 and Friday: 8:00 - 9:00 and by appointment

Website: www.cs.oswego.edu/~vampola


Course Description/Objectives:

Consideration of the basic principles and tools for analysis and retrieval of information in various information systems (both textual and database systems). Topics include analysis and storage of information, retrieval concepts and types of retrieval systems.

Texts:

Manning, Christopher et al. Introduction to Information Retrieval (Cambrdge: Cambridge Unversity Press, 2008)

PLEASE NOTE:There will be additional materials assigned in the course.


Evaluation:


Midterm Examination 20%
Final Examination 30%
Quizzes 5%
Course Assignments 20%
Final Project 25%


Personal Responsibility:

It is expected that the student will assume responsibility for his/her performance in the course. Hence, it is incumbent on the student to bring any problems that he/she might be having in completing the required coursework to the attention of the instruction as soon as possible.


Students with Disabilities:

Those students who need special consideration for whatever reason should notify the Office of Disability Services at the beginning of the semester.


Course Outline


Basic Text Storage and Retrieval


I. Boolean Seaching

Reading: Manning, Chapter 1, Regular expressions and grep readings


Text with Markup/Metadata


2. Vocabulary and Indexing

Reading: Manning, Chapters 2 - 5


3. Scoring and term weighting

Reading: Manning, Chapters 6-7


4. Evaluation and relevance

Reading: Manning, Chapters 8-9


5. Probabilistic, Bayesian and Vector Approaches

Reading: SKIM Manning, Chapters 11-12


6. Topics in Web Information Retrieval

Reading: Manning, Chapter 10; Chapters 19-21; Also hand-outs on HTML and XML


Databases


Reading: Web resources on Databases


Networked Information: Web Documents


7. Information Architecture

Reading: Web resources on Information Architecture



Networked Information: Social Networks

8 Retrieving information from the Social Web

Reading: Reading from “Social Networks for Startups” (On reserve)