[galaxy-dev] Galaxy web interface falls over when running TopHat

2011-12-10 Thread David Matthews
Hi,

We seem to have sorted out the problem of TopHat failing to run but now we have 
a new problem. When TopHat runs with a large genome (but not with small 
genomes), it finishes the run fine and all the data is there but the web 
interface falls over about 8 hours into the run and when we try to access the 
web based interface we get a Proxy Error. When we restart it all looks fine. 
This is the sort of errors our HPC people find:

Exception happened during processing of request from ('XXX.XXX.XXX.XXX', 32960)
Traceback (most recent call last):
  File /gpfs/cluster/isys/galaxy/Galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/pa
ste/httpserver.py, line 1053, in process_request_in_thread
Unexpected exception in worker function lambda at 0x201caed8
Traceback (most recent call last):
  File /gpfs/cluster/isys/galaxy/Galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/pa
ste/httpserver.py, line 863, in worker_thread_callback
Unhandled exception in thread started by bound method Thread.__bootstrap of Th
read(worker 118, stopped 1127647552)
Traceback (most recent call last):


Do you have any suggestions for what is happening?


Best Wishes,
David.

__
Dr David A. Matthews

Senior Lecturer in Virology
Room E49
Department of Cellular and Molecular Medicine,
School of Medical Sciences
University Walk,
University of Bristol
Bristol.
BS8 1TD
U.K.

Tel. +44 117 3312058
Fax. +44 117 3312091

d.a.matth...@bristol.ac.uk






___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

[galaxy-dev] How and where to install tool dependencies

2011-12-10 Thread Josh Nielsen
Hello,

I have a question which I have not seen specifically addressed in the
online Galaxy wiki documentation about how to integrate tools
(dependencies) into Galaxy. I have implemented a locally managed instance
of Galaxy that my business is using with our cluster and now have a freshly
installed and configured instance of Galaxy running. It is bare-bones right
now and I did not use mercurial to sync any existing files/directory
structures. I have seen the page on external tool dependencies (
http://wiki.g2.bx.psu.edu/Admin/Tools/Tool%20Dependencies) needed for
Galaxy, but I am somewhat unsure where to place the tools to utilize them
as intended (other than through trial  error).

It appears that there are shell directories for the tools under
~/galaxy-dist/tools/ with basic wrapper scripts but without the
corresponding executables (very few that I've noticed have the tools
already in them). Is the intent to download the dependency tools and
(building from source if necessary) take the binaries in those directories
and copy them to their corresponding directory under ~/galaxy-dist/tools/?
This seems to have worked with an error I first got when clipping a FASTQ
file which reported that fastx_clipper was not a recognized command. So I
downloaded the FASTX Toolkit, compiled the binaries, and copied only the
binaries into the corresponding fastx tools directory. Would I do the same
thing for TopHat and Cufflinks by taking all their binaries (combined) and
copying them into ~/galaxy-dist/tools/ngs_rna/?

Even if that is the case though, I have occasionally gotten errors about
tools missing in completely different directories. One was for the FASTQ
Groomer. One user saw this error in their browser (which for now is the
only way I know to figure out where tools are *expected* to be):

*File /home/galaxy/galaxy-dist/tools/rgenetics/rgFastQC.py, line 141, in
assert os.path.isfile(opts.executable),'##rgFastQC.py error - cannot find
executable %s' % opts.executable AssertionError: ##rgFastQC.py error -
cannot find executable
/home/galaxy/galaxy-dist/tool-data/shared/jars/FastQC/fastqc
*

To fix this I downloaded the FastQC tar file from its webpage, unzipped it,
and copied the fastqc binary/script to the home/galaxy/galaxy
-dist/tool-data/shared/jars/FastQC/** directory. I also had to mkdir
FastQC/ under jars/ to place it there since it didn't already exist. Had I
not been told the specific directory by the error I'm not sure how I would
have intuitively known to place the binary there (unless I'm overlooking
some critical documentation). And how do I know that other similar things
are not missing which should be there? Can anyone shed some light on this
please? Adding a brief page on the Galaxy wiki site under the Admin section
about this would really help, even if it only showed an example for one or
two specific tools.

Thanks,
Josh Nielsen
___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-dev] Galaxy web interface falls over when running TopHat

2011-12-10 Thread Jeremy Goecks
David,

What is the full stack trace that you're seeing? The stack trace is the text 
below Traceback and identifies the exact location where the problem occurs. 

Are you following these guidelines:

http://wiki.g2.bx.psu.edu/Admin/NGS%20Local%20Setup

 for setting up large genomes in Galaxy?

Also, it would be ideal if you could upload the problematic data--genome + 
reads--to our public instance (main.g2.bx.psu.edu) and see if you can reproduce 
the problem that you're seeing.

Thanks,
J.

On Dec 9, 2011, at 12:20 PM, David Matthews wrote:

 Hi,
 
 We seem to have sorted out the problem of TopHat failing to run but now we 
 have a new problem. When TopHat runs with a large genome (but not with small 
 genomes), it finishes the run fine and all the data is there but the web 
 interface falls over about 8 hours into the run and when we try to access the 
 web based interface we get a Proxy Error. When we restart it all looks fine. 
 This is the sort of errors our HPC people find:
 
 Exception happened during processing of request from ('XXX.XXX.XXX.XXX', 
 32960)
 Traceback (most recent call last):
   File 
 /gpfs/cluster/isys/galaxy/Galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/pa
 ste/httpserver.py, line 1053, in process_request_in_thread
 Unexpected exception in worker function lambda at 0x201caed8
 Traceback (most recent call last):
   File 
 /gpfs/cluster/isys/galaxy/Galaxy/galaxy-dist/eggs/Paste-1.6-py2.6.egg/pa
 ste/httpserver.py, line 863, in worker_thread_callback
 Unhandled exception in thread started by bound method Thread.__bootstrap of 
 Th
 read(worker 118, stopped 1127647552)
 Traceback (most recent call last):
 
 
 Do you have any suggestions for what is happening?
 
 
 Best Wishes,
 David.
 
 __
 Dr David A. Matthews
 
 Senior Lecturer in Virology
 Room E49
 Department of Cellular and Molecular Medicine,
 School of Medical Sciences
 University Walk,
 University of Bristol
 Bristol.
 BS8 1TD
 U.K.
 
 Tel. +44 117 3312058
 Fax. +44 117 3312091
 
 d.a.matth...@bristol.ac.uk
 
 
 
 
 
 
 ___
 Please keep all replies on the list by using reply all
 in your mail client.  To manage your subscriptions to this
 and other Galaxy lists, please use the interface at:
 
  http://lists.bx.psu.edu/

___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/

Re: [galaxy-dev] How and where to install tool dependencies

2011-12-10 Thread Jeremy Goecks
Josh,

 It appears that there are shell directories for the tools under 
 ~/galaxy-dist/tools/ with basic wrapper scripts but without the corresponding 
 executables (very few that I've noticed have the tools already in them). Is 
 the intent to download the dependency tools and (building from source if 
 necessary) take the binaries in those directories and copy them to their 
 corresponding directory under ~/galaxy-dist/tools/? This seems to have worked 
 with an error I first got when clipping a FASTQ file which reported that 
 fastx_clipper was not a recognized command. So I downloaded the FASTX 
 Toolkit, compiled the binaries, and copied only the binaries into the 
 corresponding fastx tools directory. Would I do the same thing for TopHat and 
 Cufflinks by taking all their binaries (combined) and copying them into 
 ~/galaxy-dist/tools/ngs_rna/?

You'll want to read about Galaxy Tool files a bit to understand the files in 
~/galaxy-dist/tools:

http://wiki.g2.bx.psu.edu/Admin/Tools/Tool%20Config%20Syntax#Admin.2BAC8-Tools.2BAC8-Tool_Config_Syntax.Galaxy_Tool_XML_File

These are not shell directories; instead, they include tool config files + 
additional wrapper scripts to run a tool in Galaxy.

To answer your question, executables for tools need to be in your path but do 
not need to be in the config/wrapper directories. For example, in an SGE 
cluster, we suggest setting the PATH environment var in ~/.sge_request

 Even if that is the case though, I have occasionally gotten errors about 
 tools missing in completely different directories. One was for the FASTQ 
 Groomer. One user saw this error in their browser (which for now is the only 
 way I know to figure out where tools are *expected* to be):
 
 File /home/galaxy/galaxy-dist/tools/rgenetics/rgFastQC.py, line 141, in 
 assert os.path.isfile(opts.executable),'##rgFastQC.py error - cannot find 
 executable %s' % opts.executable AssertionError: ##rgFastQC.py error - cannot 
 find executable /home/galaxy/galaxy-dist/tool-data/shared/jars/FastQC/fastqc 

The exception to the above is Java-based tools. For these tools, you'll need to 
use the ~/galaxy-dist/shared/jars directory. This is a limitation of Galaxy 
that will likely be addressed in the future.

 Adding a brief page on the Galaxy wiki site under the Admin section about 
 this would really help, even if it only showed an example for one or two 
 specific tools.

I looked a bit but couldn't find it; I suspect it is out on the wiki somewhere, 
though clearly it needs to be easier to find.

Good luck,
J.


___
Please keep all replies on the list by using reply all
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:

  http://lists.bx.psu.edu/