This is an automated email from the ASF dual-hosted git repository.
janardhan pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/systemds-website.git
The following commit(s) were added to refs/heads/main by this push:
new db4d646d [SYSTEMDS-3389] Add clickbait dataset to the website
db4d646d is described below
commit db4d646d096637e9babea197b4bec436f72bb07a
Author: Stefan Schoerkmeier <[email protected]>
AuthorDate: Mon Jun 13 17:56:40 2022 +0200
[SYSTEMDS-3389] Add clickbait dataset to the website
Closes #110.
---
_src/_datasets/clickbait.md | 18 ++++++++++++++++++
_src/assets/datasets/clickbait/clickbait.7z | Bin 0 -> 561715 bytes
2 files changed, 18 insertions(+)
diff --git a/_src/_datasets/clickbait.md b/_src/_datasets/clickbait.md
new file mode 100644
index 00000000..eb0c30d5
--- /dev/null
+++ b/_src/_datasets/clickbait.md
@@ -0,0 +1,18 @@
+---
+layout: datasets
+title: Clickbait Dataset
+description: Dataset for Classification of news headlines into clickbait or
non-clickbait.
+link: clickbait
+---
+
+The data is collected from various news sites.
+The clickbait headlines are collected from sites such as ‘BuzzFeed’,
‘Upworthy’, ‘ViralNova’, ‘Thatscoop’, ‘Scoopwhoop’ and ‘ViralStories’.
+The relevant or non-clickbait headlines are collected from many trustworthy
news sites such as ‘WikiNews’, ’New York Times’, ‘The Guardian’, and ‘The
Hindu’.
+
+Home Page:
[https://www.kaggle.com/datasets/amananandrai/clickbait-dataset](https://www.kaggle.com/datasets/amananandrai/clickbait-dataset)
+
+Download Size: `1.84 MiB`
+
+### Compressed Files
+
+1. [`clickbait.7z`: whole data used for the LSTM
example](https://systemds.apache.org/assets/datasets/clickbait/clickbait.7z)
\ No newline at end of file
diff --git a/_src/assets/datasets/clickbait/clickbait.7z
b/_src/assets/datasets/clickbait/clickbait.7z
new file mode 100644
index 00000000..3c30a1d5
Binary files /dev/null and b/_src/assets/datasets/clickbait/clickbait.7z differ