Hi Arthur, > 1) The first one and the one i find most interesting can be to try to > introduce the map-reduce framework to help to speed-up the pairwise > alignment in the creation of the muliple sequence alignment.
That would be a possible application. > 2)If the input files are big enough, it can be interesting to perform the > parsing on this files while using a distributed infrastructure to speedup > the process, I am not sure if I have encountered such large files as of yet. Do you have an example? > 3)Another idea can be to try to have a hadoopify version of blast, in which > the input file also can be splitted and then for each sequence in a chunk, > the node would perform a local blast query. I agree, another possible application... What frameworks did you think about using? Andreas _______________________________________________ Biojava-l mailing list - [email protected] http://lists.open-bio.org/mailman/listinfo/biojava-l
