bandishDB is a database of symbolic scores of North Indian Classical vocal compositions (bandishes) in **bhat notation. It comprises of over 135 compositions in more than 6 raags with >13000 notes. The dataset is still being built and will be updated with many more raags.
The symbolic music notations for bandishes were encoded using the books:
- Vishnu Narayan Bhatkhande, Hindustani Sangeet Paddhati : Kramik Pustak Maalika Vol. I-VI, Sangeet Karyalaya, 1990.
- Ram Ashray Jha, Abhinav Geetanjali Vol. I-V, Sangeet Sadan Prakashan, 2001.
Please reference the following publication when using this database:
P. Chordia. A system for the analysis and representation of bandishes and gats using humdrum syntax. In Frontiers of Research in Speech and Music Conference,
The up-to-date dataset can be obtained here as a zipped archive: bandishDB
JNMR Tabla DB
The JNMR tabla database is comprised of 35 compositions containing a total of 27,000 strokes. The symbolic notation (in a Humdrum format called **bol) and corresponding audio are given. For further information regarding the encoding scheme please see: Parag Chordia. "Automatic transcription and representation of solo tabla music." Computing in Musicology. Vol. 13. The audio was synthesized using professionally recorded samples. For each stroke category, there are approximately ten samples, which are chosen randomly to simulate the natural variance in timbre between strokes of the same category. Please reference the following publication when using this database:
P. Chordia, A. Sastry, and S. Senturk. Predictive tabla modeling using variable-length markov and hidden markov models. Journal of New Music Research, 40(2):105-118, 2011.
idm09 (tempo db)
The idm09 used in our ISMIR 2009 paper consists of tenty-seven 30 second excerpts chosen from the IDM/glitch genre of electronic music, with an emphasis on tracks that we thought were rhythmically complex. For each excerpt, two independent manual annotations were made.
A compressed, zipped tarball of all the mp3 files: idm09.tar.gz
Tempo ground truth (textfile): idm09ground.txt
The excerpts can be previewed individually here
We hope to make this database (feature matrices and recordings) available for download soon.
This database was initially assembled for a project in applying content-based recommendation techniques to Indian Classical music. For this reason its content is quite wide ranging, in terms of sub-genre, era, recording quality, performer, instrument, etc.
- Commercial and non-commercial recordings, many live concerts
- Early 20th century to present
- Wide range of recording quality
- 897 tracks
- 40 hours
- 141 artists
- 14 instruments
- 171 raags (71 raags had only one recording)
This database was assembled primarily from recordings that I made in Kolkata in 2006. Most of the recordings contain only the solo instrument or voice. The recordings were made with accompanying instruments, mainly tanpura, and tabla for metric sections, but the solo instrument was extracted from the mutli-tracked sessions. These recordings are excellent for melody classification and modeling tasks. An accompanying text file gives the artist, raag, instrument, and tonic frequency ("Sa").
- 19 musicians (6 instrumentalists, 13 vocalists)
- 3 - 60 minute long recordings
- wide range of recording quality
- wide range of accompaniment (includes completely unaccompanied studio recordings)
- over 20 hours of material
Raags included (31):