[galaxy-dev] 回复: 回复: 回复: Speed up the galaxy

2012-12-07 Thread 泽 蔡
Hi Alex
 
Now is an another problem. I now deal with two fastq files, there are Illumina 
enconding 1.8 and pair-end, so I don't need to groom. But the fact is, I need 
to use the filter by quality and Fastq interlacer and without groom thses 
two tools can not regonize the files. Any idea to solve this problem?
 


 发件人: Bossers, Alex alex.boss...@wur.nl
收件人: 泽 蔡 caizexi...@yahoo.com.cn 
抄送: galaxy-dev@lists.bx.psu.edu galaxy-dev@lists.bx.psu.edu 
发送日期: 2012年12月5日, 星期三, 4:02 上午
主题: RE: 回复: [galaxy-dev] 回复:  Speed up the galaxy
  
It seems old Illumina encoding 1.5

So yes some tools requiring the fastqsanger would need grooming.

If you are up to programming you can seriously speed this up by using a 
precalculated transfer or hash table.

That way you do not have to do any calculation but just translate each quality 
line using generic regexp/grep/sed like tools or $seq =~ 
t/STARTSCORES/SANGERSCORES/g in perl.

The wiki table could be an alternative option. You still have to set the upload 
type to fastsanger likely since it will probably sniff the header and see its 
fastq but not 1.8+ encoding.



Good luck!

Alex




Van: 泽 蔡 [caizexi...@yahoo.com.cn]
Verzonden: dinsdag 4 december 2012 16:41
To: Bossers, Alex
Cc: galaxy-dev@lists.bx.psu.edu
Onderwerp: 回复: [galaxy-dev] 回复: Speed up the galaxy

Hi Alex

I look the page of wikipedia, but I have a little confused. We sequenced with 
solexa. I paste a read of my data, can you tell me whether I need run Groom?
The read is like this:@HWUSI-EAS1734_0003_FC620JEAAXX:8:1:1174:9013#0/1
AGAAGTACATCGCGATGCCGTTNCCNNCGAAGGCGATAGNNNACAAGNCCAAATGNTTCTNCATCNNNCNCGAGNNGNCGAGGNCGCCGTGCGACCCTGC
+HWUSI-EAS1734_0003_FC620JEAAXX:8:1:1174:9013#0/1
Ya^a`edddeddc\c`a`dc]\Ba^BBZ]ZZ`ZZZ]a]]BBB^[`\UB_V[V\`ZBSZX^

发件人: Bossers, Alex alex.boss...@wur.nl
收件人: 泽 蔡 caizexi...@yahoo.com.cn
抄送: galaxy-dev@lists.bx.psu.edu galaxy-dev@lists.bx.psu.edu; Nate Coraor 
n...@bx.psu.edu
发送日期: 2012年12月4日, 星期二, 下午 11:21
主题: RE: [galaxy-dev] 回复: Speed up the galaxy

Hi

I presume the best way to optimise your current problem is to evaluate whether 
you really need to groom your data!? If its old data presumably yes, but if it 
is recent data in Illumina 1.8+ 
encoding(http://en.wikipedia.org/wiki/FASTQ_format) it is not necessary 
speedup 100% :-)
Groom would take long at our servers as well but due to the new Illumina format 
we didn’t bother to optimise it further by parallelisation for instance...

Alex


Van: galaxy-dev-boun...@lists.bx.psu.edu 
[mailto:galaxy-dev-boun...@lists.bx.psu.edu] Namens ? ?
Verzonden: dinsdag 4 december 2012 16:08
Aan: Nate Coraor
CC: galaxy-dev@lists.bx.psu.edu
Onderwerp: [galaxy-dev] 回复: Speed up the galaxy

Hi,

I read the page of the document. But I don't think there is anything I need. 
Now, I installed a local instance of galaxy, and I just run it on a PC. I need 
to deal with large data, if I just use the default configure of galaxy, every 
task would take long time. For example I ran the FASTQ Groomer with a large 
file, the precedure is so slow, and galaxy would not use the potential of my 
machine. So I want to know how can I get galaxy ran faster. I already can 
upload files quickly and I just need to know how to ran tools quickly.
发件人: Nate Coraor n...@bx.psu.edumailto:n...@bx.psu.edu
收件人: 泽 蔡 caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cn
抄送: galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edu 
galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edu
发送日期: 2012年12月4日, 星期二, 下午 9:38
主题: Re: [galaxy-dev] Speed up the galaxy

On Dec 4, 2012, at 4:36 AM, 泽 蔡 wrote:

 Hi all,

 How can I speed up the galaxy? Like how to use more cores and memeries.

Hi,

This is a pretty broad question.  However, I would recommend that you start at:

    http://usegalaxy.org/production

--nate

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-dev] 回复: 回复: 回复: Speed up the galaxy

2012-12-07 Thread Langhorst, Brad
You can just change the format of the data from fastq to fastqsanger if you're 
sure about the error format (use the pencil, then datatype tab) note: 
fastqsanger !=fastqcsanger

Brad
On Dec 7, 2012, at 3:20 AM, 泽 蔡 
caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cn wrote:

Hi Alex

Now is an another problem. I now deal with two fastq files, there are Illumina 
enconding 1.8 and pair-end, so I don't need to groom. But the fact is, I need 
to use the filter by quality and Fastq interlacer and without groom thses 
two tools can not regonize the files. Any idea to solve this problem?

发件人: Bossers, Alex alex.boss...@wur.nlmailto:alex.boss...@wur.nl
收件人: 泽 蔡 caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cn
抄送: galaxy-dev@lists.bx.psu.edumailto:galaxy-dev@lists.bx.psu.edu 
galaxy-dev@lists.bx.psu.edumailto:galaxy-dev@lists.bx.psu.edu
发送日期: 2012年12月5日, 星期三, 4:02 上午
主题: RE: 回复: [galaxy-dev] 回复: Speed up the galaxy

It seems old Illumina encoding 1.5

So yes some tools requiring the fastqsanger would need grooming.

If you are up to programming you can seriously speed this up by using a 
precalculated transfer or hash table.

That way you do not have to do any calculation but just translate each quality 
line using generic regexp/grep/sed like tools or $seq =~ 
t/STARTSCORES/SANGERSCORES/g in perl.

The wiki table could be an alternative option. You still have to set the upload 
type to fastsanger likely since it will probably sniff the header and see its 
fastq but not 1.8+ encoding.



Good luck!

Alex




Van: 泽 蔡 [caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cn]
Verzonden: dinsdag 4 december 2012 16:41
To: Bossers, Alex
Cc: galaxy-dev@lists.bx.psu.edumailto:galaxy-dev@lists.bx.psu.edu
Onderwerp: 回复: [galaxy-dev] 回复: Speed up the galaxy

Hi Alex

I look the page of wikipedia, but I have a little confused. We sequenced with 
solexa. I paste a read of my data, can you tell me whether I need run Groom?
The read is like this:@HWUSI-EAS1734_0003_FC620JEAAXX:8:1:1174:9013#0/1
AGAAGTACATCGCGATGCCGTTNCCNNCGAAGGCGATAGNNNACAAGNCCAAATGNTTCTNCATCNNNCNCGAGNNGNCGAGGNCGCCGTGCGACCCTGC
+HWUSI-EAS1734_0003_FC620JEAAXX:8:1:1174:9013#0/1
Ya^a`edddeddc\c`a`dc]\Ba^BBZ]ZZ`ZZZ]a]]BBB^[`\UB_V[V\`ZBSZX^

发件人: Bossers, Alex alex.boss...@wur.nlmailto:alex.boss...@wur.nl
收件人: 泽 蔡 caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cn
抄送: galaxy-dev@lists.bx.psu.edumailto:galaxy-dev@lists.bx.psu.edu 
galaxy-dev@lists.bx.psu.edumailto:galaxy-dev@lists.bx.psu.edu; Nate Coraor 
n...@bx.psu.edumailto:n...@bx.psu.edu
发送日期: 2012年12月4日, 星期二, 下午 11:21
主题: RE: [galaxy-dev] 回复: Speed up the galaxy

Hi

I presume the best way to optimise your current problem is to evaluate whether 
you really need to groom your data!? If its old data presumably yes, but if it 
is recent data in Illumina 1.8+ 
encoding(http://en.wikipedia.org/wiki/FASTQ_format) it is not necessary 
speedup 100% :-)
Groom would take long at our servers as well but due to the new Illumina format 
we didn’t bother to optimise it further by parallelisation for instance...

Alex


Van: 
galaxy-dev-boun...@lists.bx.psu.edumailto:galaxy-dev-boun...@lists.bx.psu.edu 
[mailto:galaxy-dev-boun...@lists.bx.psu.edumailto:galaxy-dev-boun...@lists.bx.psu.edu]
 Namens ? ?
Verzonden: dinsdag 4 december 2012 16:08
Aan: Nate Coraor
CC: galaxy-dev@lists.bx.psu.edumailto:galaxy-dev@lists.bx.psu.edu
Onderwerp: [galaxy-dev] 回复: Speed up the galaxy

Hi,

I read the page of the document. But I don't think there is anything I need. 
Now, I installed a local instance of galaxy, and I just run it on a PC. I need 
to deal with large data, if I just use the default configure of galaxy, every 
task would take long time. For example I ran the FASTQ Groomer with a large 
file, the precedure is so slow, and galaxy would not use the potential of my 
machine. So I want to know how can I get galaxy ran faster. I already can 
upload files quickly and I just need to know how to ran tools quickly.
发件人: Nate Coraor 
n...@bx.psu.edumailto:n...@bx.psu.edumailto:n...@bx.psu.edumailto:n...@bx.psu.edu
收件人: 泽 蔡 
caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cn
抄送: 
galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edu
 
galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edu
发送日期: 2012年12月4日, 星期二, 下午 9:38
主题: Re: [galaxy-dev] Speed up the galaxy

On Dec 4, 2012, at 4:36 AM, 泽 蔡 wrote:

 Hi all,

 How can I speed up the galaxy? Like how to use more cores and memeries.

Hi,

This is a pretty broad question.  However, I would recommend that you start at:

http://usegalaxy.org/production

--nate

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and 

[galaxy-dev] 回复: 回复: 回复: 回复: Speed up the galaxy

2012-12-07 Thread 泽 蔡
Hi Brad,
 
I'm sorry, I can not quite follow you. The pencil and the datatype tab are 
tools in galaxy?
 


 发件人: Langhorst, Brad langho...@neb.com
收件人: 泽 蔡 caizexi...@yahoo.com.cn 
抄送: galaxy-dev@lists.bx.psu.edu Dev galaxy-dev@lists.bx.psu.edu 
发送日期: 2012年12月7日, 星期五, 7:53 下午
主题: Re: [galaxy-dev] 回复: 回复:  回复:  Speed up the galaxy
  

You can just change the format of the data from fastq to fastqsanger if you're 
sure about the error format (use the pencil, then datatype tab) note: 
fastqsanger !=fastqcsanger 

 
Brad

On Dec 7, 2012, at 3:20 AM, 泽 蔡 caizexi...@yahoo.com.cn wrote: 

Hi Alex 
  
Now is an another problem. I now deal with two fastq files, there are Illumina 
enconding 1.8 and pair-end, so I don't need to groom. But the fact is, I need 
to use the filter by quality and Fastq interlacer and without groom thses 
two tools can not regonize the files. Any idea to solve this problem? 

 


 发件人: Bossers, Alex alex.boss...@wur.nl
收件人: 泽 蔡 caizexi...@yahoo.com.cn 
抄送: galaxy-dev@lists.bx.psu.edu galaxy-dev@lists.bx.psu.edu 
发送日期: 2012年12月5日, 星期三, 4:02 上午
主题: RE: 回复: [galaxy-dev] 回复: Speed up the galaxy
 
It seems old Illumina encoding 1.5

So yes some tools requiring the fastqsanger would need grooming.

If you are up to programming you can seriously speed this up by using a 
precalculated transfer or hash table.

That way you do not have to do any calculation but just translate each quality 
line using generic regexp/grep/sed like tools or $seq =~ 
t/STARTSCORES/SANGERSCORES/g in perl.

The wiki table could be an alternative option. You still have to set the 
upload type to fastsanger likely since it will probably sniff the header and 
see its fastq but not 1.8+ encoding.



Good luck!

Alex




Van: 泽 蔡 [caizexi...@yahoo.com.cn]
Verzonden: dinsdag 4 december 2012 16:41
To: Bossers, Alex
Cc: mailto:galaxy-dev@lists.bx.psu.edu
Onderwerp: 回复: [galaxy-dev] 回复: Speed up the galaxy

Hi Alex

I look the page of wikipedia, but I have a little confused. We sequenced with 
solexa. I paste a read of my data, can you tell me whether I need run Groom?
The read is like this:@HWUSI-EAS1734_0003_FC620JEAAXX:8:1:1174:9013#0/1
AGAAGTACATCGCGATGCCGTTNCCNNCGAAGGCGATAGNNNACAAGNCCAAATGNTTCTNCATCNNNCNCGAGNNGNCGAGGNCGCCGTGCGACCCTGC
+HWUSI-EAS1734_0003_FC620JEAAXX:8:1:1174:9013#0/1
Ya^a`edddeddc\c`a`dc]\Ba^BBZ]ZZ`ZZZ]a]]BBB^[`\UB_V[V\`ZBSZX^

发件人: Bossers, Alex alex.boss...@wur.nl
收件人: 泽 蔡 caizexi...@yahoo.com.cn
抄送: galaxy-dev@lists.bx.psu.edu galaxy-dev@lists.bx.psu.edu; Nate Coraor 
n...@bx.psu.edu
发送日期: 2012年12月4日, 星期二, 下午 11:21
主题: RE: [galaxy-dev] 回复: Speed up the galaxy

Hi

I presume the best way to optimise your current problem is to evaluate whether 
you really need to groom your data!? If its old data presumably yes, but if it 
is recent data in Illumina 1.8+ 
encoding(http://en.wikipedia.org/wiki/FASTQ_format) it is not necessary 
speedup 100% :-)
Groom would take long at our servers as well but due to the new Illumina 
format we didn’t bother to optimise it further by parallelisation for 
instance...

Alex


Van: mailto:galaxy-dev-boun...@lists.bx.psu.edu 
[mailto:galaxy-dev-boun...@lists.bx.psu.edu] Namens ? ?
Verzonden: dinsdag 4 december 2012 16:08
Aan: Nate Coraor
CC: mailto:galaxy-dev@lists.bx.psu.edu
Onderwerp: [galaxy-dev] 回复: Speed up the galaxy

Hi,

I read the page of the document. But I don't think there is anything I need. 
Now, I installed a local instance of galaxy, and I just run it on a PC. I need 
to deal with large data, if I just use the default configure of galaxy, every 
task would take long time.
 For example I ran the FASTQ Groomer with a large file, the precedure is so 
slow, and galaxy would not use the potential of my machine. So I want to know 
how can I get galaxy ran faster. I already can upload files quickly and I just 
need to know how to ran
 tools quickly.
发件人: Nate Coraor n...@bx.psu.edumailto:n...@bx.psu.edu
收件人: 泽 蔡 caizexi...@yahoo.com.cnmailto:caizexi...@yahoo.com.cn
抄送: galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edu 
galaxy-...@bx.psu.edumailto:galaxy-...@bx.psu.edu
发送日期: 2012年12月4日, 星期二, 下午 9:38
主题: Re: [galaxy-dev] Speed up the galaxy

On Dec 4, 2012, at 4:36 AM, 泽 蔡 wrote:

 Hi all,

 How can I speed up the galaxy? Like how to use more cores and memeries.

Hi,

This is a pretty broad question.  However, I would recommend that you start at:

    http://usegalaxy.org/production

--nate

 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/






   
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and 

Re: [galaxy-dev] binary data types get corrupted on upload

2012-12-07 Thread John Chilton
Hi Ido,

Is .sqlite a common compressed file type (i.e. zip, gzip, or bz2) just
with a different extension?

If it is, I think upload.py is always going to try to decompress it.
That that has long been a behavior I would like to be able to disable
on a per file type basis. If this is the problem and the galaxy team
agrees to accept a patch for address the issue, I would be happy to
try to implement the fix.

-John

On Fri, Dec 7, 2012 at 8:39 AM, Ido Tamir ta...@imp.ac.at wrote:

 from http://wiki.galaxyproject.org/Admin/Datatypes/Adding%20Datatypes
 I thought that adding subclass=True in datatypes_conf.xml would be enough 
 to register my new
 type, but it seems like I stil have to add an entry in binary.py for it to be 
 recognized. But its still corrupted.

 thank you very much,
 ido

 --
 Tool: Upload File
 Name:   hg19.sqlite
 Created:Dec 07, 2012
 Filesize:   11.1 MB
 Dbkey:  hg19
 Format: sqlite
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] error when looking at tool error

2012-12-07 Thread Andreas Kuntzagk

If I click on the small bug icon for a dataset in error state, I get this 
message in the log:

URL: http://bbc.mdc-berlin.de/galaxy/dataset/errors?id=e96c6ce43a3d239a
File '/data/galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/paste/exceptions/errormiddleware.py', line 
143 in __call__

  app_iter = self.application(environ, start_response)
File '/data/galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/paste/recursive.py', 
line 80 in __call__
  return self.application(environ, start_response)
File 
'/data/galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/paste/httpexceptions.py', 
line 632 in __call__
  return self.application(environ, start_response)
File '/data/galaxy/galaxy-dist/lib/galaxy/web/framework/base.py', line 160 in 
__call__
  body = method( trans, **kwargs )
File 
'/data/galaxy/galaxy-dist/lib/galaxy/webapps/galaxy/controllers/dataset.py', 
line 171 in errors
  hda = trans.sa_session.query( model.HistoryDatasetAssociation ).get( id )
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py', 
line 507 in get

  return self._get(key, ident)
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py', 
line 1500 in _get

  return q.all()[0]
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py', 
line 1267 in all

  return list(self)
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py', 
line 1361 in __iter__

  return self._execute_and_instances(context)
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py', 
line 1364 in _execute_and_instances
  result = self.session.execute(querycontext.statement, params=self._params, 
mapper=self._mapper_zero_or_none())
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/session.py', 
line 755 in execute

  clause, params or {})
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py', 
line 824 in execute

  return Connection.executors[c](self, object, multiparams, params)
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py', 
line 874 in _execute_clauseelement

  return self.__execute_context(context)
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py', 
line 896 in __execute_context

  self._cursor_execute(context.cursor, context.statement, 
context.parameters[0], context=context)
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py', 
line 950 in _cursor_execute

  self._handle_dbapi_exception(e, statement, parameters, cursor, context)
File '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py', 
line 931 in _handle_dbapi_exception

  raise exc.DBAPIError.instance(statement, parameters, e, 
connection_invalidated=is_disconnect)
DataError: (DataError) invalid input syntax for integer: e96c6ce43a3d239a
 'SELECT history_dataset_association.id AS history_dataset_association_id, 
history_dataset_association.history_id AS history_dataset_association_history_id, 
history_dataset_association.dataset_id AS history_dataset_association_dataset_id, 
history_dataset_association.create_time AS history_dataset_association_create_time, 
history_dataset_association.update_time AS history_dataset_association_update_time, 
history_dataset_association.state AS history_dataset_association_state, 
history_dataset_association.copied_from_history_dataset_association_id AS 
history_dataset_association_copied_from_history_dataset_a_1, 
history_dataset_association.copied_from_library_dataset_dataset_association_id AS 
history_dataset_association_copied_from_library_dataset_d_2, history_dataset_association.hid AS 
history_dataset_association_hid, history_dataset_association.name AS 
history_dataset_association_name, history_dataset_association.info AS 
history_dataset_association_info, history_dataset_association.blurb AS 
history_dataset_association_blurb, history_dataset_association.peek AS 
history_dataset_association_peek, history_dataset_association.tool_version AS 
history_dataset_association_tool_version, history_dataset_association.extension AS 
history_dataset_association_extension, history_dataset_association.metadata AS 
history_dataset_association_metadata, history_dataset_association.parent_id AS 
history_dataset_association_parent_id, history_dataset_association.designation AS 
history_dataset_association_designation, history_dataset_association.deleted AS 
history_dataset_association_deleted, history_dataset_association.purged AS 
history_dataset_association_purged, history_dataset_association.visible AS 
history_dataset_association_visible, dataset_1.id AS dataset_1_id, dataset_1.create_time AS 
dataset_1_create_time, dataset_1.update_time AS dataset_1_update_time, dataset_1.state AS 
dataset_1_state, 

Re: [galaxy-dev] binary data types get corrupted on upload

2012-12-07 Thread Ido Tamir
The specification does not mention zip, gzip or bz2 [1] and
I am pretty confident, that it does not in addition compresses
its content with zip etc..
It starts with a header SQLite format 3\000 
which is visible in my file with less.

ido

[1]http://www.sqlite.org/fileformat.html

On Dec 7, 2012, at 3:50 PM, John Chilton wrote:

 Hi Ido,
 
 Is .sqlite a common compressed file type (i.e. zip, gzip, or bz2) just
 with a different extension?
 
 If it is, I think upload.py is always going to try to decompress it.
 That that has long been a behavior I would like to be able to disable
 on a per file type basis. If this is the problem and the galaxy team
 agrees to accept a patch for address the issue, I would be happy to
 try to implement the fix.
 
 -John
 
 On Fri, Dec 7, 2012 at 8:39 AM, Ido Tamir ta...@imp.ac.at wrote:
 
 from http://wiki.galaxyproject.org/Admin/Datatypes/Adding%20Datatypes
 I thought that adding subclass=True in datatypes_conf.xml would be enough 
 to register my new
 type, but it seems like I stil have to add an entry in binary.py for it to 
 be recognized. But its still corrupted.
 
 thank you very much,
 ido
 
 --
 Tool: Upload File
 Name:   hg19.sqlite
 Created:Dec 07, 2012
 Filesize:   11.1 MB
 Dbkey:  hg19
 Format: sqlite


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] How to Install FastQC?

2012-12-07 Thread greg
Thanks Thon.  What is the TOOLSHED option?

As far as FASTQC, I downloaded and extracted it, but now what?  Do I
dump all of those files and directories in tool-data/shared/jars?

 wget 
 http://www.bioinformatics.babraham.ac.uk/projects/fastqc/fastqc_v0.10.1.zip
 unzip fastqc_v0.10.1.zip
 ls FastQC/
Contaminants  fastqc  fastqc_icon.ico  Help  INSTALL.txt
jbzip2-0.9.jar  LICENSE.txt  README.txt  RELEASE_NOTES.txt
run_fastqc.bat  sam-1.32.jar  Templates  uk

Thanks,

Greg

On Thu, Dec 6, 2012 at 6:40 PM, Anthonius deBoer thondeb...@me.com wrote:
 Hi Greg,

 Indeed you need to install many many tools to make galaxy run.
 You may be better off getting the cloud version, which has many tools
 already installed.
 Getting GALAXY to run with all the tools that are provided is not a trivial
 task...

 It gets MUCH better with the TOOLSHED option, since this can install many of
 the tools you may need...
 the old way of galaxy was to rely on many of the tools to be available on
 the path so they magically run, but things like GATK and FASTQC etc. that
 are run from JAVA jars, need to be installed manually into that directory
 tool-data/shared/jars

 Thon


 On Dec 06, 2012, at 11:36 AM, greg margeem...@gmail.com wrote:

 I'm guessing FastQC isn't installed by default?

 This page give these instructions:
 http://wiki.galaxyproject.org/Admin/Tools/Tool%20Dependencies

 FastQC (0.10.0)NGS: QC and manipulationFastQC from Babraham. The main
 fastqc perl script must be executable from
 tool-data/shared/jars/FastQC/fastqc

 But I don't understand what it means. Do I need to download something,
 change permissions? I don't have anything inside the jars folder.

 Thanks,

 Greg
 ___
 Please keep all replies on the list by using reply all
 in your mail client. To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:

 http://lists.bx.psu.edu/
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] newbie uploader

2012-12-07 Thread Nate Coraor
On Dec 6, 2012, at 7:16 PM, Smith, Therese wrote:

 Dear Sir or Madam,
 We are using DOS prompt ftp, we connected, we sent (8Gbytes), and we 
 received a succcessful completion message from ftp, we saw the file appearing 
 on your page as having been ftp'd to you, however, we received an upload 
 error message, namely:
  
 34: s_8_1_sequence.txt.gz
 error
 An error occurred running this job: Traceback (most recent call last): File 
 /galaxy/home/g2main/galaxy_main/tools/data_source/upload.py, line 384, 
 in__main__() File 
 /galaxy/home/g2main/galaxy_main/tools/data_source/upload.py, line 373, in 
 __main__ add_file( dataset
  
 We are trying again, with binary channel in ftp,and also would be glad of 
 your advice.
  
 Our Galaxy login is:  abdul.ban...@uconn.edu

Hi Therese,

If you could send back an email after your file has finished uploading, but not 
run the upload tool on it, we can check to make sure it's okay on our 
filesystem before you attempt to import, and also test importing it ourselves.

Thanks,
--nate

  
 Sincerely,
 Rouf and Therese
  
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
  http://lists.bx.psu.edu/


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


Re: [galaxy-dev] error when looking at tool error

2012-12-07 Thread Nate Coraor
On Dec 7, 2012, at 9:54 AM, Andreas Kuntzagk wrote:

 If I click on the small bug icon for a dataset in error state, I get this 
 message in the log:
 
 URL: http://bbc.mdc-berlin.de/galaxy/dataset/errors?id=e96c6ce43a3d239a
 File 
 '/data/galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/paste/exceptions/errormiddleware.py',
  line 143 in __call__
  app_iter = self.application(environ, start_response)
 File '/data/galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/paste/recursive.py', 
 line 80 in __call__
  return self.application(environ, start_response)
 File 
 '/data/galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/paste/httpexceptions.py', 
 line 632 in __call__
  return self.application(environ, start_response)
 File '/data/galaxy/galaxy-dist/lib/galaxy/web/framework/base.py', line 160 in 
 __call__
  body = method( trans, **kwargs )
 File 
 '/data/galaxy/galaxy-dist/lib/galaxy/webapps/galaxy/controllers/dataset.py', 
 line 171 in errors
  hda = trans.sa_session.query( model.HistoryDatasetAssociation ).get( id )
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py',
  line 507 in get
  return self._get(key, ident)
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py',
  line 1500 in _get
  return q.all()[0]
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py',
  line 1267 in all
  return list(self)
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py',
  line 1361 in __iter__
  return self._execute_and_instances(context)
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/query.py',
  line 1364 in _execute_and_instances
  result = self.session.execute(querycontext.statement, params=self._params, 
 mapper=self._mapper_zero_or_none())
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/orm/session.py',
  line 755 in execute
  clause, params or {})
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py',
  line 824 in execute
  return Connection.executors[c](self, object, multiparams, params)
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py',
  line 874 in _execute_clauseelement
  return self.__execute_context(context)
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py',
  line 896 in __execute_context
  self._cursor_execute(context.cursor, context.statement, 
 context.parameters[0], context=context)
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py',
  line 950 in _cursor_execute
  self._handle_dbapi_exception(e, statement, parameters, cursor, context)
 File 
 '/data/galaxy/galaxy-dist/eggs/SQLAlchemy-0.5.6_dev_r6498-py2.6.egg/sqlalchemy/engine/base.py',
  line 931 in _handle_dbapi_exception
  raise exc.DBAPIError.instance(statement, parameters, e, 
 connection_invalidated=is_disconnect)
 DataError: (DataError) invalid input syntax for integer: e96c6ce43a3d239a
 'SELECT history_dataset_association.id AS history_dataset_association_id, 
 history_dataset_association.history_id AS 
 history_dataset_association_history_id, 
 history_dataset_association.dataset_id AS 
 history_dataset_association_dataset_id, 
 history_dataset_association.create_time AS 
 history_dataset_association_create_time, 
 history_dataset_association.update_time AS 
 history_dataset_association_update_time, history_dataset_association.state AS 
 history_dataset_association_state, 
 history_dataset_association.copied_from_history_dataset_association_id AS 
 history_dataset_association_copied_from_history_dataset_a_1, 
 history_dataset_association.copied_from_library_dataset_dataset_association_id
  AS history_dataset_association_copied_from_library_dataset_d_2, 
 history_dataset_association.hid AS history_dataset_association_hid, 
 history_dataset_association.name AS history_dataset_association_name, 
 history_dataset_association.info AS history_dataset_association_info, 
 history_dataset_associa!
 tion.blurb AS history_dataset_association_blurb, 
history_dataset_association.peek AS history_dataset_association_peek, 
history_dataset_association.tool_version AS 
history_dataset_association_tool_version, history_dataset_association.extension 
AS history_dataset_association_extension, history_dataset_association.metadata 
AS history_dataset_association_metadata, history_dataset_association.parent_id 
AS history_dataset_association_parent_id, 
history_dataset_association.designation AS 
history_dataset_association_designation, history_dataset_association.deleted AS 
history_dataset_association_deleted, history_dataset_association.purged AS 
history_dataset_association_purged, history_dataset_association.visible AS 
history_dataset_association_visible, dataset_1.id AS dataset_1_id, 

Re: [galaxy-dev] Defunct munge processes using Torque PBS

2012-12-07 Thread Matthew Shirley
Hi Nate. I do understand that this is not a bug directly stemming from the 
Galaxy code base. Munge is really just a tool to pass user credentials between 
systems during job submission to the PBS server. Galaxy is spooling jobs 
through the PBS job runner, which presumably indirectly calls munge through 
jobs submission to the PBS server. I'm just not sure why the munge process is 
sometimes becoming corrupt. This is an issue since I rapidly reach my max 
number of threads for the Galaxy user on my head node. 

At this point I guess I'll try downloading the latest stable version of Torque 
and build RPMs. I have been using what is in EPEL for RHEL6. Thanks for the 
reply, and any other thoughts are still appreciated!

On Dec 7, 2012, at 10:51 AM, Nate Coraor n...@bx.psu.edu
 wrote:

 On Dec 6, 2012, at 2:34 PM, Matthew Shirley wrote:
 
 I am fairly new to PBS management, so I can't rule out some 
 misconfiguration, but I have a strange issue when running Galaxy with the 
 PBS job runner. It seems that munge spawns a bunch of defunct processes 
 after running Galaxy on my cluster:
 
 `ps axjf`:
 
1 25992 25991 25991 ?   -1 Sl   7   8:48 python 
 ./scripts/paster.py serve universe_wsgi.ini --daemon
 25992 26032 25991 25991 ?   -1 Z7   0:00  \_ [munge] 
 defunct
 25992 26034 25991 25991 ?   -1 Z7   0:00  \_ [munge] 
 defunct
 25992 26036 25991 25991 ?   -1 Z7   0:00  \_ [munge] 
 defunct
 
 Now, these processes are being spawned by Galaxy, and I can't figure out 
 why. Can anyone provide some insight or clues about where to start debugging 
 this? Thanks,
 
 Hi Matt,
 
 I'm not sure what munge is, it's not something provided with Galaxy.  
 Googling suggests it might be an authentication tool used in some HPC 
 environments.  Without having any familiarity with it, I can't say what 
 process in Galaxy would be interacting with it, especially since that 
 interaction must occur implicitly somewhere down the chain of normal Galaxy 
 operations.
 
 --nate
 
 
 Matt
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
 http://lists.bx.psu.edu/
 
 



___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] Cannot see data when selecting Display Data in Browser

2012-12-07 Thread Dave Walton
I'm seeing some weird behavior in our local galaxy instance and am wondering if 
anyone has seen anything similar.

I have a custom tool that I've written that generates a tab-delimted file.  
When I click on the item name in the history, I can see the contents in the 
preview window.  However, when I click on the eye (Display Data in Browser), 
it comes up empty.  Same thing if I download the file.  However, if I go to the 
database directory to examine the file on disk, the content is exactly what I 
would expect.  The permissions on the file are galaxy user/group and the file 
has permissions of 644.

I don't see any reason why I shouldn't be able to view or download this file 
through galaxy.

Any ideas?

Thanks,

Dave

---
Dave Walton
Manager Scientific Computing
The Jackson Laboratory
Bar Harbor, Maine
dave.wal...@jax.org

The information in this email, including attachments, may be confidential and 
is intended solely for the addressee(s). If you believe you received this email 
by mistake, please notify the sender by return email as soon as possible.
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-dev] Cannot see data when selecting Display Data in Browser

2012-12-07 Thread Dannon Baker
Hey Dave,

What revision are you running locally?  And, just to confirm, in galaxy the 
file is recognized as a 'tabular' file type?

-Dannon


On Dec 7, 2012, at 1:25 PM, Dave Walton dave.wal...@jax.org wrote:

 I'm seeing some weird behavior in our local galaxy instance and am wondering 
 if anyone has seen anything similar.
 
 I have a custom tool that I've written that generates a tab-delimted file.  
 When I click on the item name in the history, I can see the contents in the 
 preview window.  However, when I click on the eye (Display Data in 
 Browser), it comes up empty.  Same thing if I download the file.  However, if 
 I go to the database directory to examine the file on disk, the content is 
 exactly what I would expect.  The permissions on the file are galaxy 
 user/group and the file has permissions of 644. 
 
 I don't see any reason why I shouldn't be able to view or download this file 
 through galaxy.
 
 Any ideas?
 
 Thanks,
 
 Dave
 
 ---
 Dave Walton
 Manager Scientific Computing
 The Jackson Laboratory
 Bar Harbor, Maine
 dave.wal...@jax.org
 The information in this email, including attachments, may be confidential and 
 is intended solely for the addressee(s). If you believe you received this 
 email by mistake, please notify the sender by return email as soon as 
 possible.
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
  http://lists.bx.psu.edu/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/


[galaxy-dev] postgres connection

2012-12-07 Thread Coldren, Christopher D
Galaxy dev:

I'm installing a local galaxy instance (Dec 4, 2012 vintage) and it is running 
well under mySQL, but I'm having trouble getting it to work with a postgres 
database.

I've done the following to create the db and provide permissions:

$ whoami
galaxy
$ createdb galaxy_prod4
$ psql galaxy_prod4
psql (8.4.13)
Type help for help.
galaxy_prod4=# CREATE ROLE galaxy_prod_role4 WITH PASSWORD 'milkweed';
CREATE ROLE
galaxy_prod4=# GRANT ALL PRIVILEGES ON DATABASE galaxy_prod4 to 
galaxy_prod_role4;
GRANT
galaxy_prod4=# ALTER ROLE galaxy_prod_role4 LOGIN;
ALTER ROLE
galaxy_prod4=# \du
List of roles
 Role name |  Attributes  | Member of
---+--+---
 galaxy| Superuser| {}
   : Create role
   : Create DB
 galaxy_prod_role  | Cannot login | {}
 galaxy_prod_role4 |  | {}
 mydb_role | Cannot login | {}
 mydb_user |  | {}
 postgres  | Superuser| {}
   : Create role
   : Create DB
galaxy_prod4=# \l
   List of databases
 Name |  Owner   | Encoding |  Collation  |Ctype|  Access 
privileges
--+--+--+-+-+--
 galaxy_prod3 | galaxy   | UTF8 | en_US.UTF-8 | en_US.UTF-8 | =Tc/galaxy
: 
galaxy=CTc/galaxy
: 
galaxy_prod_role=CTc/galaxy
 galaxy_prod4 | galaxy   | UTF8 | en_US.UTF-8 | en_US.UTF-8 | =Tc/galaxy
: 
galaxy=CTc/galaxy
: 
galaxy_prod_role4=CTc/galaxy
 postgres | postgres | UTF8 | en_US.UTF-8 | en_US.UTF-8 |
 template0| postgres | UTF8 | en_US.UTF-8 | en_US.UTF-8 | =c/postgres
: 
postgres=CTc/postgres
 template1| postgres | UTF8 | en_US.UTF-8 | en_US.UTF-8 | =c/postgres
: 
postgres=CTc/postgres
(5 rows)


Looks promising, right?

My universe_wsgi.ini has the following entry:


database_connection = 
postgres:///galaxy_prod_role4:milkweed@127.0.0.1:5432/galaxy_prod4

and started the application, but it failed with:

OperationalError: (OperationalError) FATAL:  database 
galaxy_prod_role4:milkweed@127.0.0.1mailto:milkweed@127.0.0.1:5432/galaxy_prod4
 does not exist

commenting the database_connection line out allows galaxy to start, using mySQL 
I presume. Any ideas? This is my first time configuring something like this.

Thank you!
Chris
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/