Hi there, I'm having some trouble interpreting the shape of the data for
the Labeled Faces in the Wild (LFW) pairs dataset as returned by the scikit
dataset method fetch_lfw_pairs.
If I look at the training pairs data shape, lfw_pairs_train.pairs.shape, it
comes back as (2200, 2, 62, 47). I understand that 2200 is the # of
training samples, while 2 is each pair of the images whether they are the
same or not. Why are the final parts of the array 62 and 47 when the size
of the images are 62x67? Seems like it should be 62 and 67 for the final
shape of that array, not 47. What am I missing? :)
Also, what is the difference between the 'pairs' and 'data' members? Seems
like they both essentially have the same data, except one is unrolled?
Finally, I see that the target values are 1 and 0. I'm assuming that 1
means the two pairs are the same while 0 means they are not?
Best,
Brad Neuberg
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general