Re: [ANN] Audio Waveform display

Mark Smith Sun, 15 Jan 2006 05:56:31 -0800

Sivakatirswami, I think the audio to midi functionality exists in afew of the pro sequencing packages, but you might take a look atMelodyne : http://www.celemony.com/

I don't know of anything that can extract a single voice from arecording of a full band/orchestra/choir. Perhaps it can be done fora clarinet or oboe (I haven't seen this, myself) since thoseinstruments produce a relatively simple sound (not so many complexharmonics) unless blown very hard. A human voice is a very much morecomplex sound, and obviously much more variable, which might explainthe difficulty.

Getting the pitches of a solo voice recording can certainly be done,and googling 'audio to midi' produces a lot of results. I suspectthis is one of those areas like photoshop-type image transformations,where it could probably be done in transcript, but would likely be soslow as to be not worth the effort in practice. For a start, you'dneed to read every sample point of the binary audio file, rather thantake every n samples as my waveform display does, and so just readingthe data would take a long time. I suppose you could do it in chunks,which would probaly be more effeicient, but then you've still got toanalyse the frequency of the peaks and troughs (I imagine this is howit would be done), and I just can't see this being done in transcriptspeedily enough for the amounts of data involved. Maybe someone willprove me wrong though, as I proved myself wrong about displying thewaveform!

For Indian music, you'd have to be more fine-grained than the westerndiatonic scale (semi-tones), as Indian music has it's own system thatuses quarter tones (at least - I'm not very knowledgable about this),but you could just maintain a small database of different scales -Indian, Western, Malaysian etc.

Anyway, if you decide to try it, feel free to use any code from mydisplay control (the file reading part might be a useful startingpoint), and let me know if you think I can help.



Best,

Mark

On 15 Jan 2006, at 04:40, Sivakatirswami wrote:

Fascinating... I wonder how far we can take this: "analyze sound"--> analyze song --> output notes. Here's is a specific applicationI would be very interested in:
Take a vocal song and analyze the pitch-melody and output somemusical notation. The idea is to "capture the tune." In this casewe need to display shifts in hertz over time, and not just amplitude.
Of course this may be inventing the wheel, but a search on the webdoesn't turn up much other than MIDI to notation and some veryobscure cmd line tools from the world of European polyphonic music.Maybe some of our other music wizard will chime in here. I had atool for this years ago but they went out of business. I thoughtFinale had a plug in for it, but I don't see it and this is thepremier notation program...
Even if thereis something out there... a rev app would be nice:
One would have to set up a range-distance in hertz for pitchchanges that would be equivalent to a half step on the 12 noteoctave. 73.333 hertz per step, I think... The pitch wave formwould set a marker every time the pitch changes by that much....Now you could use known values (440 = A with a toleance of 2 ,438-442 = the note A).
I always thought the "pollution" of a sound track (singer hadinstruments playing behind him or her) would make it nigh wellimpossible for artificial intelligence to pull out the voice onlyand export to notes. They have stuff for this that you can attachto a clarinet or an oboe, but thats a single sound, not a musicrecording... But if we had a GUI that showed pitches and the usercould chose points through time as "the ones to use" then theprogram would use those points (which would have hertz values) andexport to notation.
For the kind of Indian musical vocal I'm talking about, there willbe a very strong melodic line, akin to recording a clarinet sansmuch else behind it. Outputting to western standard time values(quarter notes, half notes, whole notes) could be dispensed withinitially (too big a mountain to climb) I would get the melodicline output and send it to one on our team in Indian Svaramnotation and let them enter them into the tala. The indian systemis very simple since it makes no attempt to offer an entire musicalstaff (chords), but you would just get, ala the old hypercard musicnotation (where minus equals flat and plus = sharp): output likethis:
Mohana raga: c d f g a c c a g f d c or Mayamalavagaula; c c+ e fg g+ b c
(I wrote some stuff in HC for this years ago, brought it intoSupercard and then lost it...but I'm sure there is lots of stuffaround still playing in this pond.)
Point: the output is a simple linear export of hertz valuesrepresented as chars that = musical notes, separated by a space.I believe there is a convention for this linear form, already wellestablished, (what you put for next octave or lower octave...etc.) that can even be read and played back as MIDI byHypercard .... but it's been so long...
Sivakatirswami





On Jan 14, 2006, at 4:49 PM, Mark Smith wrote:
Alejandro, there shouldn't be any problem using it in Windows, or'Nix for that matter, I just don't have anything but OS Xmachines to test it on. It's all pure transcript.
I still have a way to go until it's really finished, but hopefullyI'll get it done over the next few days.
Mark
_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage yoursubscription preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution


_______________________________________________
use-revolution mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-revolution

Re: [ANN] Audio Waveform display

Reply via email to