Re: Where can I store data files in a tomcat war

Paul Taylor Wed, 02 Jul 2014 03:50:28 -0700


I guess I'm a little confused as to what this means.


I have a simple WAR based web application that uses Lucene created
indexes to provide search results in a xml format.

Especially given the following context:

and supplementary question how do I modify my pom file to do this
with maven

I was under the impression that Paul was building a separate
application using Lucene during the build stage to create the
indexes, but then using an application - specific mechanism to use
those indexes.

That's what I thought, too.

Yes correct, let me explain it a bit further. I'm trying to deploy anapplication that serves results from a lucene index in response to userrequests. Deploying it manually to my own server is fine, first of all Ijust copy the index files to a location on the disk, then I deploy myapplication, and within its web.xml I have a servlet parameter thatdefines where the indexes are, so within the servlets init() method iinitilize the indexes. The problem is that I'm trying to deploy myapplication to Amazon Web Services using autoscaled Elastic Beanstalk,this means that the application has to be able to be initilized andcreated based on what is in the war because Elastic Beanstalk willautomatically start new servers as required due to load and terminatethose instances when not required.

I do seem to have a solution, but I detail it here because it doesn'tseem quite right and might be useful to others.


Short Answer:

Originally I first tried putting the index files (unzipped) into thesrc/main/resources folder of my maven project, and referred to theWEB-INF/classes/index_dir location in my web.xml and tomcat didn'tstart. It didnt seem right for non Java classes to be in that folderanyway so I discarded that idea, however Ive just tried it again locallyand it worked so if it works on EB that is the solution I'm going to usefor now unless any better suggestions. It does mean that the resulting.war file is rather large, far too large to upload from my localmachine but as I build the code and indexes from another AWS EC2instance I can just dump it into S3, and deploy from S3 to EB, if I needto redeploy you dont seem able to redeploy from S3 but Ive realised thatwhen I need to redeploy I would do it to a new EB configuration and thenswap the dns from EB1 to EB2 to mimimize downtime so that is not reallya problem.


A supplementary question:

Is there a system property I can use to refer to the WEB-INF as arelative directory rather than full path


Long Answer:

Since originally posting this question I have looked at a few otherpossible solutions but none were satisfactory.

1. Deploy war without indexes but in my servlet init() method write codeto grab the compressed indexes from S3 and unzip to location specifiedin web.xml. This worked with a single instance EB but unfortunately AWSdoes not wait for the init() method (which takes 20 minutes) to finishbefore declaring it, and this meant because it was busy unzippingindexes and could not serve request it caused AWS monitoring to declareit to busy and open another two instances, once all three instancesfinished their init() method they were all up and working , then a fewminutes two were terminated because not needed. But this means if serveris genuinely busy the newly started instances will be declared ready byAWS but fail to service requests during the init() period. This seemslike a bug with AWS but not going to change anytime soon.

2. Deploy war without indexes and use AWS .ebextensions files to graband unzip the indexes. This might work but I really dislike having towrite custom deployment code/configurations as a general rule. Andbecause the size of the disk provided by the AWS instanceis limited, unzipping is not so simple. For example instead of creatinga tar.gz file , I had to gzip the files first and then tar so whenuntarrred I could decompress one file at a time which required lesstemporaray space, this would make the eb code more complex.

3. Create a custom Amazon Image that can be used by EB, this seemstheoretically possible but quickly got very messy and seemed very much ahack.

4. Use Docker, AWS now supports the docker framework. This might be agood solution but having spent far too much time on understanding AWS Iwasnt keen to spen dmore time on yet another framework to solve one problem

If the Lucene API is used, then writing a servlet context listener
that digs out the initial indexes and places them in java.io.tmpdir
in a known subdirectory is probably the way to go. This ensures
that even if a WAR file is not exploded, the Lucene DirectoryReader
API can get to the files.

That's precisely what I was suggesting.

So this is what I did with 1> but because of the AWS issue didnt work aswell as hoped.


Paul

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org

Re: Where can I store data files in a tomcat war

Reply via email to