PaperUtils

Utilities for creating content for LaTex papers with Julia. Implements some utilities for statistical comparison of classifiers as described in this paper.

Computation of ranks across datasets:

using PaperUtils
using DataFrames

# this contains AUC values for individual models on different datasets
perf_df = DataFrame(
	:dataset =>  ["abalone", "haberman", "iris", "yeast"],
	:knn => [0.85, 0.71, 0.98, 0.78],
	:ocsvm => [0.8, 0.68, 0.93, 0.7],
	:if => [0.7, 0.61, 0.93, 0.65]
	)
4×4 DataFrame
│ Row │ dataset  │ knn     │ ocsvm   │ if      │
│     │ String   │ Float64 │ Float64 │ Float64 │
├─────┼──────────┼─────────┼─────────┼─────────┤
│ 1   │ abalone  │ 0.85    │ 0.8     │ 0.7     │
│ 2   │ haberman │ 0.71    │ 0.68    │ 0.61    │
│ 3   │ iris     │ 0.98    │ 0.93    │ 0.93    │
│ 4   │ yeast    │ 0.78    │ 0.7     │ 0.65    │

# here we get the ranks (ties receive average rank)
rank_df = PaperUtils.rankdf(perf_df)
5×4 DataFrame
│ Row │ dataset   │ knn     │ ocsvm   │ if      │
│     │ String    │ Float64 │ Float64 │ Float64 │
├─────┼───────────┼─────────┼─────────┼─────────┤
│ 1   │ abalone   │ 1.0     │ 2.0     │ 3.0     │
│ 2   │ haberman  │ 1.0     │ 2.0     │ 3.0     │
│ 3   │ iris      │ 1.0     │ 2.5     │ 2.5     │
│ 4   │ yeast     │ 1.0     │ 2.0     │ 3.0     │
│ 5   │ mean rank │ 1.0     │ 2.125   │ 2.875   │

Friedman test decides whether the performance of models based on mean ranks is the same for all compared models:

R = Array(rank_df[end, 2:end])
n = 4 # no. datasets
k = 3 # no. models
α = 0.1 
fts = PaperUtils.friedman_test_statistic(R,n,k)
7.125

ftcv = PaperUtils.friedman_critval(α, k)
5.99146454710798

Since fts>fctv, the difference in ranks is statistically significant and we can continue with the paired Nemenyi test which compares the differences in mean ranks in two models.

ncd = PaperUtils.nemenyi_cd(k, n, α)
1.451190106751196

This means that performance of knn and ocsvm is statistically speaking the same, but that of knn and if models is not. We can show this in a critical difference diagram where the mean ranks of models are drawn and the models with the same performance are connected with a horizontal band.

algnames = ["knn", "ocsvm", "if"]
crit_diag = PaperUtils.ranks2tikzcd(R, algnames, ncd)
print(crit_diag)
\begin{tikzpicture}[scale=1.0] 
  \draw (1.0,0) -- (3.0,0); 
  \foreach \x in {1,...,3} \draw (\x,0.10) -- (\x,-0.10) node[anchor=north]{$\x$}; 
  \draw (1.0,0) -- (1.0,0.19999999999999998) -- (0.9, 0.19999999999999998) node[anchor=east] {knn}; 
  \draw (2.125,0) -- (2.125,0.5) -- (0.9, 0.5) node[anchor=east] {ocsvm}; 
  \draw (2.875,0) -- (2.875,0.2) -- (3.1, 0.2) node[anchor=west] {if}; 
  \draw[line width=0.06cm,color=black,draw opacity=1.0] (0.97,0.1) -- (2.155,0.1); 
  \draw[line width=0.06cm,color=black,draw opacity=1.0] (2.095,0.2) -- (2.905,0.2); 
 \end{tikzpicture} 

# save the string to file
PaperUtils.string2file("tikz/example.tikz", crit_diag)

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
src		src
tikz		tikz
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PaperUtils

About

Releases

Packages

Languages

vitskvara/PaperUtils.jl

Folders and files

Latest commit

History

Repository files navigation

PaperUtils

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages