@@ -44,7 +44,7 @@ def summarise_battery(battery):
4444(sec: suite-v1 )=
4545# Benchmark Suite (v` r VERSION ` )
4646
47- We have compiled ** a large suite of benchmark datasets** .
47+ We have compiled, curated, and polished ** a large suite of benchmark datasets** .
4848For reproducibility, the datasets and label vectors are ** versioned** .
4949
5050
@@ -77,7 +77,7 @@ index *g*, where $g=0$ means that all clusters consist of the same number
7777of points.
7878
7979
80- ::::{important }
80+ ::::{note }
8181The versioned ** snapshots of the suite** are available for download at:
8282<https://github.com/gagolews/clustering-data-v1/releases/tag/v`r VERSION`>.
8383
@@ -107,9 +107,11 @@ each dataset is accompanied by a text file specifying more details thereon
107107(e.g., the literature references that we are asked to cite).
108108
109109
110- As a courtesy, ** please cite** also the current project
110+ ::::{important}
111+ As a courtesy, ** please cite** the original source as well as the current project
111112{cite}` clustering_benchmarks ` as well as mention {cite}` clustering_data_v1 `
112113which gives the exact version and URL of the dataset suite. Thank you.
114+ ::::
113115
114116
115117There is some inherent overlap between the original databases.
@@ -189,7 +191,7 @@ summarise_battery("sipu")
189191(sec: battery-fcps )=
190192## ` fcps `
191193
192- 9 datasets from the * Fundamental Clustering Problem Suite*
194+ Nine datasets from the * Fundamental Clustering Problem Suite*
193195proposed by A. Ultsch {cite}` fcps ` from the Marburg University,
194196Germany.
195197
@@ -214,7 +216,7 @@ summarise_battery("fcps")
214216(sec: battery-graves )=
215217## ` graves `
216218
217- 10 * synthetic data sets* discussed by D. Graves and W. Pedrycz
219+ Ten * synthetic data sets* discussed by D. Graves and W. Pedrycz
218220in {cite}` graves ` .
219221
220222The dataset consist of 200–1050 observations in 2 dimensions.
@@ -272,7 +274,7 @@ summarise_battery("other")
272274(sec: battery-uci )=
273275## ` uci `
274276
275- A selection of 8 high-dimensional datasets available through the UCI
277+ A selection of eight high-dimensional datasets available through the UCI
276278(University of California, Irvine)
277279[ Machine Learning Repository] ( http://archive.ics.uci.edu/ml/ ) {cite}` uci ` .
278280Some of them were considered for benchmark purposes
0 commit comments