[
https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15968600#comment-15968600
]
ASF GitHub Bot commented on TIKA-2306:
--------------------------------------
thammegowda commented on a change in pull request #163: TIKA-2306: Update
Inception v3 to Inception v4 in Object recognition parser
URL: https://github.com/apache/tika/pull/163#discussion_r111525259
##########
File path:
tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/InceptionRestDockerfile
##########
@@ -16,26 +16,34 @@
#
FROM ubuntu
-MAINTAINER Thamme Gowda <[email protected]>
+MAINTAINER Thamme Gowda <[email protected]> and Kranthi Kiran GV
<[email protected]>
-# install missing part of ubuntu core + python stuff
+# Install missing part of ubuntu core + python stuff
RUN apt-get update && \
- apt-get install -y python-pip python-dev wget
+ apt-get install -y python-pip python-dev wget libtcmalloc-minimal4 git
# Install tensorflow and other dependencies
RUN \
- pip install --upgrade
https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.9.0-cp27-none-linux_x86_64.whl
&& \
+ pip install --upgrade
https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-1.0.1-cp27-none-linux_x86_64.whl
&& \
pip install flask requests
+# Get the TF-slim dependencies
+# Forked for future compatibility
+RUN git clone https://github.com/KranthiGV/models
+
# TODO: Change the URL to Apache/Tika Repo when this PR gets merged
RUN \
- wget
https://raw.githubusercontent.com/thammegowda/tika/TIKA-1993/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py
-O /usr/bin/inceptionapi.py && \
+ wget
https://raw.githubusercontent.com/KranthiGV/tika/TIKA-2306/tika-parsers/src/main/resources/org/apache/tika/parser/recognition/tf/inceptionapi.py
-O /usr/bin/inceptionapi.py && \
Review comment:
TODO: when we merge this PR, we modify it and then merge it
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Update Inception v3 to Inception v4 in Object recognition parser
> -----------------------------------------------------------------
>
> Key: TIKA-2306
> URL: https://issues.apache.org/jira/browse/TIKA-2306
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.14
> Reporter: Kranthi Kiran GV
> Priority: Minor
> Labels: inception, object_recognition
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> Object Recognition Parser currently uses Inception V3 model for the object
> classification task. Google released a newer Inception V4 model [1][2].
> It has an improved Top -1 accuracy of 80.2 and Top-5 accuracy of 95.2 [3].
> I believe that Tika community would benefit from it. I would be working on
> this issue in the next few days.
> [1] https://research.googleblog.com/2016/08/improving-inception-and-image.html
> [2] https://arxiv.org/abs/1602.07261
> [3] https://github.com/tensorflow/models/tree/master/slim
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)