-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thank you for the fix!
@zachgk @lanking520 could you help review the license? Thanks! |
LICENSE
Outdated
@@ -276,6 +276,7 @@ | |||
Copyright (c) 2015 by Contributors | |||
Copyright 1984, 1987, 1992 by Stephen L. Moshier | |||
|
|||
27. CNN Text Classification Example - For details, see example/cnn_text_classification/data_helpers.py |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The LICENSE file is more specifically our source release license. It should only refer to things which are bundled as part of the source release (http://www.apache.org/dev/licensing-howto.html). Maybe we could move this to a separate DATASET_LICENSE file?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm changing the dataset license link as a comment of the code that downloads it. As these dataset are not bit included in our distribution, and should not be included in top level LICENSE file.
LICENSE
Outdated
@@ -349,6 +350,19 @@ | |||
Copyright 2012 Continuum Analytics, Inc. | |||
|
|||
|
|||
======================================================================================= | |||
Creative Commons Attribution 4.0 International (CC BY 4.0) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note that there are 6 different licenses as part of CC BY 4.0 (https://creativecommons.org/licenses/). It is important to know which one because some of them will prevent commercial usage, prevent derivative works, or require others to use the same license.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
provided link to exact license statement at the place that downloads it.
@@ -101,7 +101,19 @@ def download_imdb(data_dir='/tmp/data'): | |||
''' | |||
Download and extract the IMDB dataset | |||
''' | |||
url = ('http://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz') | |||
# dataset from http://ai.stanford.edu/~amaas/data/sentiment/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
While it is good to include the full citation here, also add the information on licensing and copyrights to the README or whatever docs people read which tells them to download the data. The idea is that some of these licenses actually have consequences. For example, we don't want to let commercial users accidentally work with a non-commercial dataset. So, our goal is to make sure that any time we inform users about a dataset, we also explain what legal requirements come with that dataset as well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is not license or copy right for this dataset, only citation requriement.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just say the copyright is whatever person or group seems to have created it. And, say that they require attribution in place of the usual license name (but link to the request)
Thanks for your contributions @roywei . |
@@ -26,6 +26,8 @@ def unzip_file(filename, outpath): | |||
z.extract(name, outpath) | |||
fh.close() | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need to also put the information below in the corresponding documents. In this case, it would be example/gluon/style_transfer/README.md
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
@@ -101,7 +101,19 @@ def download_imdb(data_dir='/tmp/data'): | |||
''' | |||
Download and extract the IMDB dataset | |||
''' | |||
url = ('http://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz') | |||
# dataset from http://ai.stanford.edu/~amaas/data/sentiment/ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just say the copyright is whatever person or group seems to have created it. And, say that they require attribution in place of the usual license name (but link to the request)
Co-Authored-By: Zach Kimberg <[email protected]>
Co-Authored-By: Zach Kimberg <[email protected]>
@zachgk I have added both at source code and README. Also added copyright |
* update license * update license * fix typo * update license * add comment * Update example/gluon/style_transfer/dataset/download_dataset.py Co-Authored-By: Zach Kimberg <[email protected]> * Update example/gluon/embedding_learning/get_cub200_data.sh Co-Authored-By: Zach Kimberg <[email protected]> * update license * add license * trigger ci * fix large tensor * update copy right * fix wrong commit * fix * trigger
* update license * update license * fix typo * update license * add comment * Update example/gluon/style_transfer/dataset/download_dataset.py Co-Authored-By: Zach Kimberg <[email protected]> * Update example/gluon/embedding_learning/get_cub200_data.sh Co-Authored-By: Zach Kimberg <[email protected]> * update license * add license * trigger ci * fix large tensor * update copy right * fix wrong commit * fix * trigger
* update license * update license * fix typo * update license * add comment * Update example/gluon/style_transfer/dataset/download_dataset.py Co-Authored-By: Zach Kimberg <[email protected]> * Update example/gluon/embedding_learning/get_cub200_data.sh Co-Authored-By: Zach Kimberg <[email protected]> * update license * add license * trigger ci * fix large tensor * update copy right * fix wrong commit * fix * trigger
Description
As suggested in 1.4.1 release dev list discussion, replacing MNIST url to canonical URL. https://lists.apache.org/thread.html/0cb2131f2506661a884f89d8419aba08298cbc50aaeeda06e41e530f@%3Cdev.mxnet.apache.org%3E
and update the license for datasets used in examples.
Checklist
Essentials
Please feel free to remove inapplicable items for your PR.
Changes
Comments