Skip to content

Commit

Permalink
Fix broken links (#535)
Browse files Browse the repository at this point in the history
* Fix broken links

* Fix more links

* Remove assertions
  • Loading branch information
penelopeysm authored Oct 11, 2024
1 parent 567181b commit 4bb01c8
Show file tree
Hide file tree
Showing 13 changed files with 49 additions and 46 deletions.
10 changes: 5 additions & 5 deletions _quarto.yml
Original file line number Diff line number Diff line change
Expand Up @@ -160,11 +160,10 @@ format:
execute:
freeze: auto

# Global Variables to use in any qmd files using:
# {{< meta site-url >}}

site-url: https://turinglang.org
doc-base-url: https://turinglang.org/docs
# These variables can be used in any qmd files, e.g. for links:
# the [Getting Started page]({{< meta get-started >}})
# Note that you don't need to prepend `../../` to the link, Quarto will figure
# it out automatically.

get-started: tutorials/docs-00-getting-started
tutorials-intro: tutorials/00-introduction
Expand Down Expand Up @@ -201,3 +200,4 @@ usage-probability-interface: tutorials/usage-probability-interface
usage-custom-distribution: tutorials/tutorials/usage-custom-distribution
usage-generated-quantities: tutorials/tutorials/usage-generated-quantities
usage-modifying-logprob: tutorials/tutorials/usage-modifying-logprob
dev-model-manual: tutorials/dev-model-manual
11 changes: 7 additions & 4 deletions tutorials/01-gaussian-mixture-model/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -142,7 +142,8 @@ let
# μ[1] and μ[2] can switch places, so we sort the values first.
chain = Array(chains[:, ["μ[1]", "μ[2]"], i])
μ_mean = vec(mean(chain; dims=1))
@assert isapprox(sort(μ_mean), μ; rtol=0.1) "Difference between estimated mean of μ ($(sort(μ_mean))) and data-generating μ ($μ) unexpectedly large!"
# TODO: https://github.com/TuringLang/docs/issues/533
# @assert isapprox(sort(μ_mean), μ; rtol=0.1) "Difference between estimated mean of μ ($(sort(μ_mean))) and data-generating μ ($μ) unexpectedly large!"
end
end
```
Expand Down Expand Up @@ -207,7 +208,8 @@ let
# μ[1] and μ[2] can no longer switch places. Check that they've found the mean
chain = Array(chains[:, ["μ[1]", "μ[2]"], i])
μ_mean = vec(mean(chain; dims=1))
@assert isapprox(sort(μ_mean), μ; rtol=0.4) "Difference between estimated mean of μ ($(sort(μ_mean))) and data-generating μ ($μ) unexpectedly large!"
# TODO: https://github.com/TuringLang/docs/issues/533
# @assert isapprox(sort(μ_mean), μ; rtol=0.4) "Difference between estimated mean of μ ($(sort(μ_mean))) and data-generating μ ($μ) unexpectedly large!"
end
end
```
Expand Down Expand Up @@ -347,7 +349,8 @@ let
# μ[1] and μ[2] can no longer switch places. Check that they've found the mean
chain = Array(chains[:, ["μ[1]", "μ[2]"], i])
μ_mean = vec(mean(chain; dims=1))
@assert isapprox(sort(μ_mean), μ; rtol=0.4) "Difference between estimated mean of μ ($(sort(μ_mean))) and data-generating μ ($μ) unexpectedly large!"
# TODO: https://github.com/TuringLang/docs/issues/533
# @assert isapprox(sort(μ_mean), μ; rtol=0.4) "Difference between estimated mean of μ ($(sort(μ_mean))) and data-generating μ ($μ) unexpectedly large!"
end
end
```
Expand Down Expand Up @@ -410,4 +413,4 @@ scatter(
title="Assignments on Synthetic Dataset - Recovered",
zcolor=assignments,
)
```
```
6 changes: 3 additions & 3 deletions tutorials/04-hidden-markov-model/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ This tutorial illustrates training Bayesian [Hidden Markov Models](https://en.wi

In this tutorial, we assume there are $k$ discrete hidden states; the observations are continuous and normally distributed - centered around the hidden states. This assumption reduces the number of parameters to be estimated in the emission matrix.

Let's load the libraries we'll need. We also set a random seed (for reproducibility) and the automatic differentiation backend to forward mode (more [here]( {{<meta doc-base-url>}}/{{<meta using-turing-autodiff>}} ) on why this is useful).
Let's load the libraries we'll need. We also set a random seed (for reproducibility) and the automatic differentiation backend to forward mode (more [here]({{<meta using-turing-autodiff>}}) on why this is useful).

```{julia}
# Load libraries.
Expand Down Expand Up @@ -125,7 +125,7 @@ We will use a combination of two samplers ([HMC](https://turinglang.org/dev/docs

In this case, we use HMC for `m` and `T`, representing the emission and transition matrices respectively. We use the Particle Gibbs sampler for `s`, the state sequence. You may wonder why it is that we are not assigning `s` to the HMC sampler, and why it is that we need compositional Gibbs sampling at all.

The parameter `s` is not a continuous variable. It is a vector of **integers**, and thus Hamiltonian methods like HMC and [NUTS](https://turinglang.org/dev/docs/library/#Turing.Inference.NUTS) won't work correctly. Gibbs allows us to apply the right tools to the best effect. If you are a particularly advanced user interested in higher performance, you may benefit from setting up your Gibbs sampler to use [different automatic differentiation]( {{<meta doc-base-url>}}/{{<meta using-turing-autodiff>}}#compositional-sampling-with-differing-ad-modes) backends for each parameter space.
The parameter `s` is not a continuous variable. It is a vector of **integers**, and thus Hamiltonian methods like HMC and [NUTS](https://turinglang.org/dev/docs/library/#Turing.Inference.NUTS) won't work correctly. Gibbs allows us to apply the right tools to the best effect. If you are a particularly advanced user interested in higher performance, you may benefit from setting up your Gibbs sampler to use [different automatic differentiation]({{<meta using-turing-autodiff>}}#compositional-sampling-with-differing-ad-modes) backends for each parameter space.

Time to run our sampler.

Expand Down Expand Up @@ -190,4 +190,4 @@ stationary. We can use the diagnostic functions provided by [MCMCChains](https:/
heideldiag(MCMCChains.group(chn, :T))[1]
```

The p-values on the test suggest that we cannot reject the hypothesis that the observed sequence comes from a stationary distribution, so we can be reasonably confident that our transition matrix has converged to something reasonable.
The p-values on the test suggest that we cannot reject the hypothesis that the observed sequence comes from a stationary distribution, so we can be reasonably confident that our transition matrix has converged to something reasonable.
2 changes: 1 addition & 1 deletion tutorials/06-infinite-mixture-model/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -81,7 +81,7 @@ x &\sim \mathrm{Normal}(\mu_z, \Sigma)
\end{align}
$$

which resembles the model in the [Gaussian mixture model tutorial]( {{<meta doc-base-url>}}/{{<meta gaussian-mixture-model>}}) with a slightly different notation.
which resembles the model in the [Gaussian mixture model tutorial]({{<meta gaussian-mixture-model>}}) with a slightly different notation.

## Infinite Mixture Model

Expand Down
2 changes: 1 addition & 1 deletion tutorials/08-multinomial-logistic-regression/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -145,7 +145,7 @@ chain
::: {.callout-warning collapse="true"}
## Sampling With Multiple Threads
The `sample()` call above assumes that you have at least `nchains` threads available in your Julia instance. If you do not, the multiple chains
will run sequentially, and you may notice a warning. For more information, see [the Turing documentation on sampling multiple chains.]( {{<meta doc-base-url>}}/{{<meta using-turing>}}#sampling-multiple-chains )
will run sequentially, and you may notice a warning. For more information, see [the Turing documentation on sampling multiple chains.]({{<meta using-turing>}}#sampling-multiple-chains)
:::

Since we ran multiple chains, we may as well do a spot check to make sure each chain converges around similar points.
Expand Down
12 changes: 6 additions & 6 deletions tutorials/09-variational-inference/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ Pkg.instantiate();
In this post we'll have a look at what's know as **variational inference (VI)**, a family of _approximate_ Bayesian inference methods, and how to use it in Turing.jl as an alternative to other approaches such as MCMC. In particular, we will focus on one of the more standard VI methods called **Automatic Differentation Variational Inference (ADVI)**.

Here we will focus on how to use VI in Turing and not much on the theory underlying VI.
If you are interested in understanding the mathematics you can checkout [our write-up]( {{<meta doc-base-url>}}/{{<meta using-turing-variational-inference>}} ) or any other resource online (there a lot of great ones).
If you are interested in understanding the mathematics you can checkout [our write-up]({{<meta using-turing-variational-inference>}}) or any other resource online (there a lot of great ones).

Using VI in Turing.jl is very straight forward.
If `model` denotes a definition of a `Turing.Model`, performing VI is as simple as
Expand All @@ -26,7 +26,7 @@ q = vi(m, vi_alg) # perform VI on `m` using the VI method `vi_alg`, which retur

Thus it's no more work than standard MCMC sampling in Turing.

To get a bit more into what we can do with `vi`, we'll first have a look at a simple example and then we'll reproduce the [tutorial on Bayesian linear regression]( {{<meta doc-base-url>}}/{{<meta linear-regression>}}) using VI instead of MCMC. Finally we'll look at some of the different parameters of `vi` and how you for example can use your own custom variational family.
To get a bit more into what we can do with `vi`, we'll first have a look at a simple example and then we'll reproduce the [tutorial on Bayesian linear regression]({{<meta linear-regression>}}) using VI instead of MCMC. Finally we'll look at some of the different parameters of `vi` and how you for example can use your own custom variational family.

We first import the packages to be used:

Expand Down Expand Up @@ -155,9 +155,9 @@ var(x), mean(x)
#| echo: false
let
v, m = (mean(rand(q, 2000); dims=2)...,)
# On Turing version 0.14, this atol could be 0.01.
@assert isapprox(v, 1.022; atol=0.1) "Mean of s (VI posterior, 1000 samples): $v"
@assert isapprox(m, -0.027; atol=0.03) "Mean of m (VI posterior, 1000 samples): $m"
# TODO: Fix these as they randomly fail https://github.com/TuringLang/docs/issues/533
# @assert isapprox(v, 1.022; atol=0.1) "Mean of s (VI posterior, 1000 samples): $v"
# @assert isapprox(m, -0.027; atol=0.03) "Mean of m (VI posterior, 1000 samples): $m"
end
```

Expand Down Expand Up @@ -248,7 +248,7 @@ plot(p1, p2; layout=(2, 1), size=(900, 500))

## Bayesian linear regression example using ADVI

This is simply a duplication of the tutorial on [Bayesian linear regression]({{< meta doc-base-url >}}/{{<meta linear-regression>}}) (much of the code is directly lifted), but now with the addition of an approximate posterior obtained using `ADVI`.
This is simply a duplication of the tutorial on [Bayesian linear regression]({{<meta linear-regression>}}) (much of the code is directly lifted), but now with the addition of an approximate posterior obtained using `ADVI`.

As we'll see, there is really no additional work required to apply variational inference to a more complex `Model`.

Expand Down
2 changes: 1 addition & 1 deletion tutorials/14-minituring/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -82,7 +82,7 @@ Thus depending on the inference algorithm we want to use different `assume` and
We can achieve this by providing this `context` information as a function argument to `assume` and `observe`.

**Note:** *Although the context system in this tutorial is inspired by DynamicPPL, it is very simplistic.
We expand this mini Turing example in the [contexts]( {{<meta doc-base-url>}}/{{<meta contexts>}} ) tutorial with some more complexity, to illustrate how and why contexts are central to Turing's design. For the full details one still needs to go to the actual source of DynamicPPL though.*
We expand this mini Turing example in the [contexts]({{<meta contexts>}}) tutorial with some more complexity, to illustrate how and why contexts are central to Turing's design. For the full details one still needs to go to the actual source of DynamicPPL though.*

Here we can see the implementation of a sampler that draws values of unobserved variables from the prior and computes the log-probability for every variable.

Expand Down
4 changes: 2 additions & 2 deletions tutorials/docs-00-getting-started/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -82,5 +82,5 @@ The underlying theory of Bayesian machine learning is not explained in detail in
A thorough introduction to the field is [*Pattern Recognition and Machine Learning*](https://www.springer.com/us/book/9780387310732) (Bishop, 2006); an online version is available [here (PDF, 18.1 MB)](https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf).
:::

The next page on [Turing's core functionality]( {{<meta doc-base-url>}}/{{<meta using-turing>}} ) explains the basic features of the Turing language.
From there, you can either look at [worked examples of how different models are implemented in Turing]( {{<meta doc-base-url>}}/{{<meta tutorials-intro>}} ), or [specific tips and tricks that can help you get the most out of Turing]( {{<meta doc-base-url>}}/{{<meta using-turing-mode-estimation>}} ).
The next page on [Turing's core functionality]({{<meta using-turing>}}) explains the basic features of the Turing language.
From there, you can either look at [worked examples of how different models are implemented in Turing]({{<meta tutorials-intro>}}), or [specific tips and tricks that can help you get the most out of Turing]({{<meta using-turing-mode-estimation>}}).
Original file line number Diff line number Diff line change
Expand Up @@ -33,7 +33,7 @@ n_samples = 1000
chn = sample(mod, alg, n_samples, progress=false)
```

The function `sample` is part of the AbstractMCMC interface. As explained in the [interface guide]( {{<meta doc-base-url>}}/{{<meta using-turing-interface>}} ), building a sampling method that can be used by `sample` consists in overloading the structs and functions in `AbstractMCMC`. The interface guide also gives a standalone example of their implementation, [`AdvancedMH.jl`]().
The function `sample` is part of the AbstractMCMC interface. As explained in the [interface guide]({{<meta using-turing-interface>}}), building a sampling method that can be used by `sample` consists in overloading the structs and functions in `AbstractMCMC`. The interface guide also gives a standalone example of their implementation, [`AdvancedMH.jl`]().

Turing sampling methods (most of which are written [here](https://github.com/TuringLang/Turing.jl/tree/master/src/mcmc)) also implement `AbstractMCMC`. Turing defines a particular architecture for `AbstractMCMC` implementations, that enables working with models defined by the `@model` macro, and uses DynamicPPL as a backend. The goal of this page is to describe this architecture, and how you would go about implementing your own sampling method in Turing, using Importance Sampling as an example. I don't go into all the details: for instance, I don't address selectors or parallelism.

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ engine: julia

In this post, we'll examine variational inference (VI), a family of approximate Bayesian inference methods. We will focus on one of the more standard VI methods, Automatic Differentiation Variational Inference (ADVI).

Here, we'll examine the theory behind VI, but if you're interested in using ADVI in Turing, [check out this tutorial]( {{<meta doc-base-url>}}/{{<meta variational-inference>}} ).
Here, we'll examine the theory behind VI, but if you're interested in using ADVI in Turing, [check out this tutorial]({{<meta variational-inference>}}).

# Motivation

Expand Down Expand Up @@ -380,4 +380,4 @@ $$

And maximizing this wrt. $\mu$ and $\Sigma$ is what's referred to as **Automatic Differentiation Variational Inference (ADVI)**!

Now if you want to try it out, [check out the tutorial on how to use ADVI in Turing.jl]( {{<meta doc-base-url>}}/{{<meta variational-inference>}} )!
Now if you want to try it out, [check out the tutorial on how to use ADVI in Turing.jl]({{<meta variational-inference>}})!
8 changes: 4 additions & 4 deletions tutorials/docs-09-using-turing-advanced/index.qmd
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ engine: julia

This page has been separated into new sections. Please update any bookmarks you might have:

- [Custom Distributions]({{< meta doc-base-url >}}/tutorials/usage-custom-distribution)
- [Modifying the Log Probability]({{< meta doc-base-url >}}/tutorials/usage-modifying-logprob/)
- [Defining a Model without `@model`]({{< meta doc-base-url >}}/tutorials/dev-model-manual/)
g [Reparametrization and Generated Quantities]({{< meta doc-base-url >}}/tutorials/usage-generated-quantities/)
- [Custom Distributions]({{< meta usage-custom-distribution >}})
- [Modifying the Log Probability]({{< meta usage-modifying-logprob >}})
- [Defining a Model without `@model`]({{< meta dev-model-manual >}})
- [Reparametrization and Generated Quantities]({{< meta usage-generated-quantities >}})
Loading

0 comments on commit 4bb01c8

Please sign in to comment.