Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "ContentMimeDetection" page has been changed by Lukeliush: https://wiki.apache.org/tika/ContentMimeDetection?action=diff&rev1=13&rev2=14 Also please note the model has to be ready before it can be used in Tika; by "ready", we mean the model has to pass the final knowledge evaluation test. As shall be seen shortly, as an example Tika is only implementing the prediction phase, so the model parameters need to be loaded and read into Tika for prediction or classification; The process of training can be lengthy and tedious, sometimes training might require parallel computation on e.g. map-reduce when training data is too large to fit memory, again this depends on the user's goal. - ''The following will briefly walk you through how the feature and example is implemented in this data problem. Please refer to the [[https://github.com/LukeLiush/filetypeDetection/blob/master/Documenation_NNModelIntegrationWithTika.docx |documentation]]for further details with the R implemenation.'' + ''The following will briefly walk you through how the feature and example is implemented in this data problem. Please also refer to the following link for more information of the implemenation in R.'' + + ''https://github.com/LukeLiush/filetypeDetection/blob/master/Documenation_NNModelIntegrationWithTika.docx '' Please also refer to the code repo for details of implementation for training or preparing for a model, the neural network and logistic regression learning are implemented in R and the following describes the pre processing and learning implementation in R.
