Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "ContentMimeDetection" page has been changed by Lukeliush: https://wiki.apache.org/tika/ContentMimeDetection?action=diff&rev1=11&rev2=12 The following line in main.R is the last line used to output the model, the name and structure can be customized according to different relish. - {{https://lh4.googleusercontent.com/9NhU8MSntrg9JRxV55sG89v5MkBM_ZzI9wo5SoYN3chzirIB_R97VImM4LUc6Cps1wJSfDlZCAE-OdCCj6OGBmeGHyKn8falen0APY1UY0B4xgCZ1EUEX3JVYcqxznNEQ2ygXpw||transform="rotate(0.00rad)",height="27px;",width="602px;",-webkit-transform="rotate(0.00rad)",border="none"}} + {{https://lh4.googleusercontent.com/9NhU8MSntrg9JRxV55sG89v5MkBM_ZzI9wo5SoYN3chzirIB_R97VImM4LUc6Cps1wJSfDlZCAE-OdCCj6OGBmeGHyKn8falen0APY1UY0B4xgCZ1EUEX3JVYcqxznNEQ2ygXpw||height="27px;",width="602px;"}} The exportNNParams method implementation resides in the utility class i.e. ‘myfunctions.R’; it can be also customized or replaced to create your own model file with different syntax or structure. @@ -139, +139 @@ The next line without # at the front shows a series of floating numbers separated by a tab, and they are model parameters, later we need to import the file into Tika and have the ExampleNNModelDetector to recreate the trained model with them in Tika so it can predict and classify the unseen file and determine with the imported model whether the given input file is a GRB or non-GRB type. - {{https://lh6.googleusercontent.com/ZkRhFs9ON4ELXTtClE9s0frCEsC_i7ktsWkmGlm10ktOCpJMorMB_UZA2K4pp6LIc8AK0c2LKhgss7ZQkhTop4eh9BBDYn-kQlC17PB21VUdMYjtvpHbUjY51XyS2iOgxSYjUIo||transform="rotate(0.00rad)",height="43px;",width="602px;",-webkit-transform="rotate(0.00rad)",border="none"}} + {{https://lh6.googleusercontent.com/ZkRhFs9ON4ELXTtClE9s0frCEsC_i7ktsWkmGlm10ktOCpJMorMB_UZA2K4pp6LIc8AK0c2LKhgss7ZQkhTop4eh9BBDYn-kQlC17PB21VUdMYjtvpHbUjY51XyS2iOgxSYjUIo||height="43px;",width="602px;"}} The following shows the printing formation produced by the R program after training in a bit more detail with the outputted/chosen model above. . [1] "Loading Dataset....." - [1] "Begining Training Neural Networks" + . [1] "Begining Training Neural Networks" - [1] "the length of weights 517" + . [1] "the length of weights 517" + . [1] "The time taken for training: 330.257000" [1] "The training error cost: 0.001380" - [1] "The time taken for training: 330.257000" - [1] "The training error cost: 0.001380" - [1] "The validation error cost: 0.025099" + . [1] "The validation error cost: 0.025099" - [1] "The testing error cost: 0.020883" + . [1] "The testing error cost: 0.020883" - [1] "Training Accuracy: 100.000000" + . [1] "Training Accuracy: 100.000000" - [1] "Validation Accuracy: 99.650000" + . [1] "Validation Accuracy: 99.650000" - [1] "Testing Accuracy: 99.762349" + . [1] "Testing Accuracy: 99.762349" ''''Import the model into Tika '''' @@ -188, +187 @@ Unit test (tika\tika-core\src\test\java) - org.apache.tika.detect. MimeDetectionWithNNTest + . org.apache.tika.detect. MimeDetectionWithNNTest
