Skip to content

Latest commit

 

History

History
44 lines (26 loc) · 3.73 KB

README.md

File metadata and controls

44 lines (26 loc) · 3.73 KB

Fall 2020 CSCI 576 Multimedia Project

Instructor: Parag Havaldar

Project description

Text based searching today has become a natural part of how we access information and there are many ways to search and rank textual information. For instance, you can search using a specific text string query while browsing a big text document and just as easily bring up other documents that contain the same query text string. Search engines like google, bing, yahoo etc. enable you to search the whole world wide web of textural web pages for specific strings and rank them in an order of importance using various classes of search algorithms.

Now with the advances in inexpensive digital video (and audio) capture devices, media data is commonplace now. There is a lot of digital video information everywhere – streamed to you via the internet and cable networks, hosted on websites and social media networks, your own personal hard disks etc. With a lot of video/audio/image information, there needs to be a search paradigm to make sense of it. However, the search paradigms and methodologies for media data are as well formed and are still related to text information and/or metadata that is annotated around the media data.

Rather than using text to query media, another natural paradigm might be to use media itself – for instance given short video clip, you want to search through an entire database of videos to find the video which contains the “same” query or “similar” content as in the query clip. The motivating question here is - what is involved in developing a system that takes a short video clip as input and very quickly produces a ranked list of videos (from a database) that either contains the queried short clip or contains similar queried short clips. This is no doubt a complex problem with many nuances to consider but is also a practical and useful problem that needs a solution.

This project aims to provide a decent solution to this hard problem.

Datasets

We will be using these following datasets:

  1. Data_jpg
  2. Data_wav

Team members

Haixiang Liu is a great student.

Yaqi Shao is a Computer Science student at USC and also a student worker at the Signal and Image Processing department at EES.

Juntao Shen is a second year graduate student in the Computer Science Data Science program. He obtained his bachelor’s degree in computer science at University of Southern California.

Milestones and timelines

Deadlines (Week of) Tasks
11/16~11/21 Team members share research findings.
11/23~11/27 Team members decide approaches and implement code to tests datasets.
11/28~11/30 Team members test out each approaches and gives a hybrid method.
11/30~12/03 Testing phases
12/04 Demo Current