Hi Michael, I think I found you posting on SO: https://stackoverflow.com/questions/45041677/java-spark-training-on-new-data-with-datasetrow-from-csv-file
The exception trace there is quite different from what I read here, and indeed is self-explanatory: ... Caused by: java.lang.IllegalArgumentException: requirement failed: The columns of A don't match the number of elements of x. A: 38611, x: 36179 ... Can it be that you have different 'features' vector sizes from train and test? Best, On Wed, Jul 12, 2017 at 1:41 PM, Kunkel, Michael C. <m.kun...@fz-juelich.de> wrote: > Greetings > > The attachment I meant to refer to was the posting in the initial email on > the email list. > > BR > MK > ---------------------------------------- > Michael C. Kunkel, USMC, PhD > Forschungszentrum Jülich > Nuclear Physics Institute and Juelich Center for Hadron Physics > Experimental Hadron Structure (IKP-1) > www.fz-juelich.de/ikp > > On Jul 12, 2017, at 09:56, Riccardo Ferrari <ferra...@gmail.com> wrote: > > Hi Michael, > > I don't see any attachment, not sure you can attach files though > > On Tue, Jul 11, 2017 at 10:44 PM, Michael C. Kunkel < > m.kun...@fz-juelich.de> wrote: > >> Greetings, >> >> Thanks for the communication. >> >> I attached the entire stacktrace in which was output to the screen. >> I tried to use JavaRDD and LabeledPoint then convert to Dataset and I >> still get the same error as I did when I only used datasets. >> >> I am using the expected ml Vector. I tried it using the mllib and that >> also didnt work. >> >> BR >> MK >> ---------------------------------------- >> Michael C. Kunkel, USMC, PhD >> Forschungszentrum Jülich >> Nuclear Physics Institute and Juelich Center for Hadron Physics >> Experimental Hadron Structure (IKP-1)www.fz-juelich.de/ikp >> >> On 11/07/2017 17:21, Riccardo Ferrari wrote: >> >> Mh, to me feels like there some data mismatch. Are you sure you're using >> the expected Vector (ml vs mllib). I am not sure you attached the whole >> Exception but you might find some more useful details there. >> >> Best, >> >> On Tue, Jul 11, 2017 at 3:07 PM, mckunkel <m.kun...@fz-juelich.de> wrote: >> >>> Im not sure why I cannot subscribe, so that everyone can view the >>> conversation. >>> Help? >>> >>> >>> >>> -- >>> View this message in context: http://apache-spark-user-list. >>> 1001560.n3.nabble.com/Testing-another-Dataset-after-ML-train >>> ing-tp28845p28846.html >>> Sent from the Apache Spark User List mailing list archive at Nabble.com. >>> >>> --------------------------------------------------------------------- >>> To unsubscribe e-mail: user-unsubscr...@spark.apache.org >>> >>> >> >> >> >> ------------------------------------------------------------ >> ------------------------------------ >> ------------------------------------------------------------ >> ------------------------------------ >> Forschungszentrum Juelich GmbH >> 52425 Juelich >> Sitz der Gesellschaft: Juelich >> Eingetragen im Handelsregister des Amtsgerichts Dueren Nr. HR B 3498 >> Vorsitzender des Aufsichtsrats: MinDir Dr. Karl Eugen Huthmacher >> Geschaeftsfuehrung: Prof. Dr.-Ing. Wolfgang Marquardt (Vorsitzender), >> Karsten Beneke (stellv. Vorsitzender), Prof. Dr.-Ing. Harald Bolt, >> Prof. Dr. Sebastian M. Schmidt >> ------------------------------------------------------------ >> ------------------------------------ >> ------------------------------------------------------------ >> ------------------------------------ >> >> >