[ 
https://issues.apache.org/jira/browse/HADOOP-7614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13211093#comment-13211093
 ] 

Steve Loughran commented on HADOOP-7614:
----------------------------------------

Configurations last a long time and can get passed around, so I'm reluctant do 
anything that could leak things or even add XML files to the in-memory 
configuration.

removing the behaviour altogether would break things (it's hard to tell from 
the general Object interface)

I think in this situation, handling reloading by doing nothing may be the best 
tactic; possibly warning the user somehow. Once an inputstream has been loaded, 
the {{loadDefaults}} flag could be set to false. Yet that would change 
behaviour in copied configurations, that will not crash on a reload.

                
> Reloading configuration when using imputstream resources results in 
> org.xml.sax.SAXParseException
> -------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-7614
>                 URL: https://issues.apache.org/jira/browse/HADOOP-7614
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: conf
>    Affects Versions: 0.21.0
>            Reporter: Ferdy Galema
>            Priority: Minor
>         Attachments: HADOOP-7614-v1.patch, HADOOP-7614-v2.patch
>
>
> When using an inputstream as a resource for configuration, reloading this 
> configuration will throw the following exception:
> Exception in thread "main" java.lang.RuntimeException: 
> org.xml.sax.SAXParseException: Premature end of file.
>       at 
> org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1576)
>       at 
> org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1445)
>       at 
> org.apache.hadoop.conf.Configuration.getProps(Configuration.java:1381)
>       at org.apache.hadoop.conf.Configuration.get(Configuration.java:569)
> ...
> Caused by: org.xml.sax.SAXParseException: Premature end of file.
>       at 
> com.sun.org.apache.xerces.internal.parsers.DOMParser.parse(DOMParser.java:249)
>       at 
> com.sun.org.apache.xerces.internal.jaxp.DocumentBuilderImpl.parse(DocumentBuilderImpl.java:284)
>       at javax.xml.parsers.DocumentBuilder.parse(DocumentBuilder.java:124)
>       at 
> org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1504)
>       ... 4 more
> To reproduce see following testcode:
>     Configuration conf = new Configuration();
>     ByteArrayInputStream bais = new 
> ByteArrayInputStream("<configuration></configuration>".getBytes());
>     conf.addResource(bais);
>     System.out.println(conf.get("blah"));
>     conf.addResource("core-site.xml"); //just add a named resource, doesn't 
> matter which one
>     System.out.println(conf.get("blah"));
> Allowing inputstream resources is flexible, but in cases such as this in can 
> lead to difficult to debug problems.
> What do you think is the best solution? We could:
> A) reset the inputstream after it is read instead of closing it (but what to 
> do when the stream does not support marking?)
> B) leave it up to the client (for example make sure you implement close() so 
> that it resets the steam)
> C) when reading the inputstream for the first time, cache or wrap the 
> contents somehow so that is can be read multiple times (let's at least 
> document it)
> D) remove inputstream method altogether
> e) something else?
> For now I have attached a patch for solution A.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to