1 / 17

Special Indices for LaaLaLaa Lyric Analysis & Generation Framework

Special Indices for LaaLaLaa Lyric Analysis & Generation Framework. Dr. V. Madhan Karky, Tamil Computing Lab ( TaCoLa ), College of Engineering Guindy , Anna University, Chennai. Overview. Objective Introduction Background Rhyme Schemes in Tamil Meter Pattern System Architecture

sanne
Download Presentation

Special Indices for LaaLaLaa Lyric Analysis & Generation Framework

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Special Indices for LaaLaLaa Lyric Analysis & Generation Framework Dr. V. Madhan Karky, Tamil Computing Lab (TaCoLa), College of Engineering Guindy, Anna University, Chennai.

  2. Overview • Objective • Introduction • Background • Rhyme Schemes in Tamil • Meter Pattern • System Architecture • Indexing Structure • Indexing Algorithm • Results and Analysis

  3. Objective • To build special indices for the LaaLaLaa lyric analysis and generation framework to facilitate faster retrieval based on • Meter Pattern • Rhyme

  4. Introduction • Tamil is a vibrant language with a rich grammar, vocabulary, an inherent poetic flavor. • About 1000 lyrics are being written every year as private albums, jingles and as original soundtracks of mainstream movies.

  5. Background • WASP (Pablo Grevas (2000)) splits a given block of text, identifies patterns and fits words from the vocabulary to get verses of similar pattern. • COLIBRI (Agudo, Grevas, Calero (2002)) follows a case based approach to poem generation.

  6. Background • Tra-la-Lyrics (Oliviera, Cardoso, Pereira (2005)) finds out the beat pattern of the midi file and places words with similar syllabic division and stress pattern. • Automatic generation of tamil lyrics for melodies (Ramakrishnan, Kuppan, Devi (2009)) converts the midi file to KNM reprsn and fits words to it from a corpus. Phrases are meaningful as parts only. No edhugai, monai or iyaybu.

  7. Background • LaaLaLaa (Sowmiya, Karky (2010)) talks of splitting raw text from midi file to templates and filling them with words from a wordnet according to the pattern mined from an existing corpus of lyrics with due consideration to rhyme, meaning and flow.

  8. Rhyme Schemes in Tamil • Two words are said to rhyme in • Monai (மோனை) - first letters are the same. • Edhugai (எதுகை) - second letters are the same. • Iyaibu (இயைபு) - last letters are the same.

  9. Rhyme Schemes in Tamil • Examples: • பறவைand பச்சை rhyme in monai. • அருவி and விருப்பு rhyme in edhugai. • யாக்கை and வாழ்க்கை rhyme in iyaibu. • அருவி and குருவி rhyme in edhugai and iyaibu. • கவிதைகள் and கவிஞர்கள் rhyme in all the three schemes.

  10. Meter Pattern • Maathirai (மாத்திரை) - time taken to wink an eyelid. • Maathirai based classification of Tamil alphabets. • Nedil (N) (நெடில்) - Those alphabets which are pronounced for the time interval of 2 maathirai. • Kuril (K) (குறில்) - Alphabets which take 1 maathirai to be pronounced. • Mei (M) (மெய்) - Alphabets which are pronounced for 0.5 maathirai. • Meter pattern of a word refers to its Kuril Nedil Mei pattern. • For example, the Meter pattern of the word பாடல் is NKM as பா is a Nedil(N), ட is a Kuril(K) and ல் is a Mei(M).

  11. System Architecture Lyric DB Word Object Convertor Rhyme Extractor Rhyme Pattern Extractor Index Builder Rhyme Meter Index

  12. Indexing Structure

  13. Indexing Algorithm

  14. Result and Analysis Retrieval Time(in milliseconds) Word

  15. Results and Analysis • Retrieval complexity of both the approaches tested using a dataset of 500 tamil words. • The average retrieval time in • Word indexed approach - 875.47millisecond • Meter Rhyme Indexed approach – 1.90millisecond

  16. Results and Analysis • The drastic decrease in retrieval time from O(α) to O(1) [α is the number of words in the database] is due to • The use of hash-tables which are efficient for retrieval. • Having separate hash-tables for the மோனை, எதுகைand இயைபு of each POS.

  17. Thank You!!!

More Related