Skip to content

University project, lecture Text Indexing. The idea is to search within audio/video for keywords by building an inverted index beforehand.

Notifications You must be signed in to change notification settings

DominikMe/multimedia-textsearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Dominik MessingerDominik Messinger
Dominik Messinger
and
Dominik Messinger
Jul 14, 2013
c9734a0 · Jul 14, 2013

History

43 Commits
Jul 8, 2013
Jul 14, 2013
Jul 3, 2013
Jul 8, 2013
Jun 30, 2013
Jul 6, 2013
Jul 11, 2013
Jul 7, 2013
Jul 8, 2013
Jun 25, 2013
Jul 8, 2013
Jul 8, 2013
Jul 6, 2013
Jul 8, 2013
Jul 12, 2013

Repository files navigation

multimedia-textsearch

Authors: Dominik Messinger, Alexander Weigl and Ge Wu
License: gpl-v3

Description

University project, lecture Text Indexing. The idea is to search within audio/video for keywords by building an inverted index beforehand.

We introduce the concept of timed documents. A timed document contains the documents text sliced into blocks with time information. These documents are produce by preprocessing from audio and video files and can be stored in a XML format. The inverted index is generated upon these timed document.

Dependencies

Java Dependencies: * Apache Commons IO * Apache Commons Lang * JavaTuples * jdom * json-simple

External Dependencies: * working tesseract installation (for win32 binaries are included) * ffmpeg (for win32 binaries are included)

About

University project, lecture Text Indexing. The idea is to search within audio/video for keywords by building an inverted index beforehand.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published