So the XML as CSV would be: item_id,similar_items,cross_action_similar_items ipad,iphone,iphone nexus iphone,ipad,ipad galaxy
Note: As I mentioned before the order of the items in the field will encode rank of the similarity strength. This is for cases where you want to find similar items to a context item. You would fetch the doc for the context item by it's item ID and show the top k items in the doc. Ted's caveat would probably be to dither them. Sounds like Ted is generating data. Andrew or M Lyon do either of you want to set the demo system up? If so you'll need to find a system--free tier AWS, Ted's box, etc. Then install all the needed stuff. I'll get the output working to csv. On Jul 31, 2013, at 11:51 AM, Pat Ferrel <[email protected]> wrote: OK and yes. The docs will look like: <add> <doc> <field name='item_id'>ipad</field> <field name='similar_items'>iphone</field> <field name='cross_action_similar_items'>iphone nexus</field> </doc> <doc> <field name='item_id'>iphone</field> <field name='similar_items'>ipad</field> <field name='cross_action_similar_items'>ipad galaxy</field> </doc> </add> On Jul 31, 2013, at 11:42 AM, B Lyon <[email protected]> wrote: I'm interested in helping as well. Btw I thought that what was stored in the solr fields were the llr-filtered items (ids I guess) for the could-be-recommended things.
