14 episodes

Tools for organizing and accessing information have become indispensable. It is critical, therefore, to understand their design and operational foundations. In this course students will have an opportunity to learn about search engines, web crawling, and some Web 2.0 technologies based on hands-on experience and with a focus on techniques that can be used to access, retrieve, organize, and present information. Students will work with practical developmental tools and learn relevant concepts through experimentation. For instance, students will employ an open source search engine and learn about indexing, retrieving, and ranking techniques.

INLS490-154W: Information Retrieval Systems Design and Implementation Chirag Shah

    • Education
    • 5.0 • 2 Ratings

Tools for organizing and accessing information have become indispensable. It is critical, therefore, to understand their design and operational foundations. In this course students will have an opportunity to learn about search engines, web crawling, and some Web 2.0 technologies based on hands-on experience and with a focus on techniques that can be used to access, retrieve, organize, and present information. Students will work with practical developmental tools and learn relevant concepts through experimentation. For instance, students will employ an open source search engine and learn about indexing, retrieving, and ranking techniques.

    • video
    Information organization

    Information organization

    This is the podcast for the thirteenth class, in which we look at a couple of ways to organize and present information to the user. We see how a term-cloud interface can be created, allowing the user to get a quick glance at the underlying collection. We also talk about a number of clustering algorithms and see how they can be implemented with Lemur.

    • 49 min
    • video
    IR on Web 2.0

    IR on Web 2.0

    This is the podcast for the twelfth class, in which we see how REST requests can be made through the web, and the responses in XML can be parsed. This allows us to start connecting with Web 2.0 sources that provide the functionality of meshing different sources by open data exchange.

    • 46 min
    • video
    Web crawling

    Web crawling

    This is the podcast for the eleventh class, in which we see traditional and non-traditional methods of collecting data off the web. Traditional way is demonstrated using web crawling using wget, and non-traditional way is instantiated with YouTube harvesting.

    • 58 min
    • video
    User interface for search

    User interface for search

    This is the podcast for the tenth class, in which we connect the back-end for search that we have been working with to a web-based front-end. This is done using Indri, a new search engine component for Lemur. We also explore some details of AJAX and see how we could use it to enhance our user interface for search.

    • 1 hr 25 min
    • video
    Evaluation-2

    Evaluation-2

    This is the podcast for the ninth class, in which we continue looking at evaluation. We talk about more measures to evaluate a query and a system. We also look at comparing two rank lists.

    • 51 min
    • video
    Evaluation-1

    Evaluation-1

    This is the podcast for the eighth class, in which we start looking at one of the core components of IR - evaluation. We begin our discussion by revisiting recall and precision, and then continue exploring R-precision, AP, and MAP. We see how these can be measured manually and then using TREC supplied tools.

    • 41 min

Customer Reviews

5.0 out of 5
2 Ratings

2 Ratings

Top Podcasts In Education

The Mel Robbins Podcast
Mel Robbins
The Jordan B. Peterson Podcast
Dr. Jordan B. Peterson
Do The Work
Do The Work
Mick Unplugged
Mick Hunt
This Is Woman's Work with Nicole Kalil
Nicole Kalil, Bleav
TED Talks Daily
TED