Hello, My name is Mihaiu Nicolae, I'm a 2nd year student in Computer Science at the Politehnica University of Bucharest and I'm very interested in working at the "New File Parsers for BioJava" project. I choose BioJava because it blends two passions of mine: coding and biology. Back in highschool biology was one of my favourite subjects, having a very good teacher from whom I learned a lot, I finished every year with 10.
About my knowledge and experience - 1 year and a half experience with Java; it became my first choice in coding; currently I do all my tasks and homework in Java, also developing a bot for aichallenge [1] in Java as a university project. And a little personal project I'm working at, a memory test game, also written in Java. - 5 years of C/C++ - web: HTML, PHP, CSS, MySQL - made a module for my school's website Some thoughts and questions about the project - I took a look at your sources and saw you already have parsers for a lot of files like: FASTA, FASTQ, PDB, mmcif etc. What are the priorities for the new parsers, which is needed most ? - Should we choose only one parser to work on for this project, or the expectations are to implement more than one ? Questions about the "Coding exercise" - About the "ambiguous characters", lets say we have ambiguous DNA. For these two sequences: "ACTATATCGG" and "ATGKMCGW" we should have in one FASTA output file the sequence "ACTATATCGG" and in another one "ATGKMCGW" ? - What do you mean by large, “be capable of reading large files”, because afterwards under “Submission” it says “the test data file named data.fasta up to 10Kb in size” ? Should I understand that 10Kb is the limit for a “large file” ? Best regards, Nicolae [1] http://aichallenge.org _______________________________________________ Biojava-l mailing list - [email protected] http://lists.open-bio.org/mailman/listinfo/biojava-l
