kirk86 commented on issue #4696: run errors URL: https://github.com/apache/incubator-mxnet/issues/4696#issuecomment-455904378 @hetong007 I haven't installed GluonCV, only `mxnet-cu90`: ``` pip list Package Version ---------- ---------- certifi 2018.11.29 chardet 3.0.4 graphviz 0.8.4 idna 2.6 mxnet-cu90 1.3.1 numpy 1.14.6 pip 18.1 requests 2.21.0 setuptools 40.6.3 urllib3 1.22 wheel 0.32.3 ``` This is the command I execute to train resnet on imagenet: ``` python ./incubator-mxnet/example/image-classification/train_imagenet.py --network resnet --num-layers 152 --data-train ./train_rec/train_imgnet_list_1.rec --data-val ./val_rec/val_imgnet_lst.rec --gpus 0,1,2,3 --batch-size 128 --model ./model/resnet152 --num-epochs 100 --kv-store device ``` I didn't open this issue on GluonCV because I'm not sure if it's 100% related with GluonCV. Also please notice the `--data-train` and `--data-valid` have to point to the `.rec` files otherwise it doesn't work which is contrary to example give [here](https://mxnet.incubator.apache.org/versions/master/tutorials/vision/large_scale_classification.html) which points only to the directory where the `*.rec` files live. Also multiple `*.rec` files don't work they have to be combined into one `.rec` file.
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services
