aaronmarkham commented on a change in pull request #12606: Refine the documentation of im2rec URL: https://github.com/apache/incubator-mxnet/pull/12606#discussion_r219236514
########## File path: docs/faq/recordio.md ########## @@ -6,35 +6,39 @@ RecordIO implements a file format for a sequence of records. We recommend storin * Packing data together allows continuous reading on the disk. * RecordIO has a simple way to partition, simplifying distributed setting. We provide an example later. -We provide the [im2rec tool](https://github.com/dmlc/mxnet/blob/master/tools/im2rec.cc) so you can create an Image RecordIO dataset by yourself. The following walkthrough shows you how. +We provide the [im2rec tool](https://github.com/dmlc/mxnet/blob/master/tools/im2rec.cc) so you can create an Image RecordIO dataset by yourself. The following walkthrough shows you how. Note that there is python version of [im2rec tool](https://github.com/apache/incubator-mxnet/blob/master/tools/im2rec.py) and [example](https://mxnet.incubator.apache.org/tutorials/basic/data.html) using real-world data. ### Prerequisites + Download the data. You don't need to resize the images manually. You can use ```im2rec``` to resize them automatically. For details, see the "Extension: Using Multiple Labels for a Single Image," later in this topic. ### Step 1. Make an Image List File + +* Note that the im2rec.py provide a param `--list` to generate the list for you but im2rec.cc doesn't support it. Review comment: you, but ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
