mastodon.world is one of the many independent Mastodon servers you can use to participate in the fediverse.
Generic Mastodon server for anyone to use.

Server stats:

8.4K
active users

#quarto

4 posts4 participants1 post today

This week, I learned how to create and explore a data lake with duckdb, using its new ducklake extension. It was surprisingly easy to hand over the creation and management of parquet files with larg(ish) tables to ducklake. I loved being able to explore the data using R, python or plain SQL - even within the same Quarto document! tomsing1.github.io/blog/posts/ #RStats #python #duckdb #ducklake #quarto

tomsing1.github.ioDipping my toes into the ducklake: Exploring gene expression data with R and python – Thomas Sandmann’s blog
Continued thread

Bernard Angele is back at the lectern #WoReLa1 with their own talk this time on "Low sampling rate is not an obstacle to making reading research more accessible". The #Quarto slides* are available here bangele.quarto.pub/worela2025/ and they include rather hilarious distorted maps of the world illustrating the geographical distribution of #eyetracking studies on reading. Less hilariously, it turns out that, up until recently, the vast majority of eyetracking studies were carried out in the US and West Europe on major European languages. Now studies on Chinese in China are changing the picture, but still eyetracking reading studies are still limited to very few languages.

* Also, it turns out that using #QuartoPub to host Quarto slides is not only useful to share slides with the audience and on social media, but also to easily switch computer when the presenter's laptop can no longer connect with the temperamental projector.

bangele.quarto.pubLow sampling rate is not an obstacle to making reading research more accessible

Some work on my solar-powered #quarto website project - now with a small data transmission tracker 🤩.

The "No JavaScript" is not true anymore though, but the widget is quite lightweight, as far as I can tell.

(gotta say an LLM was heavily involved in creating this one, given my extremely limited javascript skills).

Here is a #quarto #rstats trick that could be useful and I just figured out! If you want to parametrise the creation of pages within a website, instead of using a purrr loop with quarto_render on a .qmd temlpate, use a purrr loop with writeLines to create your .qmd files in a script loaded before rendering with the pre-render yml option!

A blog post is coming soon but you can already have a look at my repository for example: github.com/damien-dupre/cere20

Contribute to damien-dupre/cere2025 development by creating an account on GitHub.
GitHubGitHub - damien-dupre/cere2025Contribute to damien-dupre/cere2025 development by creating an account on GitHub.

I think I need to apologise to my supervisor.

They said, on a recent report we were sent, "It doesn't even have captions on the figures or the tables." I replied, "it is clearly done in RMarkdown or Quarto and no one knows how to do captions on those."

Of course this bugged me so I first looked up how to do them on ggplot2. Oh, you just add it in the labs. That seems amazingly simple. I mean yes they were lattice plots, but there must be a similar flag.

A few hours later, not happy with the answer, I looked up how to get cross referenced figure captions in a code block in Quarto. Turns out it is

```
#| label: fig-name
#| fig-cap: "Your caption here"
```

in the code block. The same is true of tables (you just replace fig with tbl). It even puts the caption at the top for tables. Geez, that is amazingly simple again. (All of this learnt from quarto.org/docs/authoring/figu ) Again, yes this is Quarto and they probably are using RMarkdown but it has to be fairly similar.

So, I now agree that there should be captions on those figures. Yes, its an internal document and no doubt automated but still that is no worse than LaTeX or HTML.

It does mean that now I have to actually caption my figures and tables next time too.

Two plots arranged side-by-side with a large plot underneath it. The top two plots are scatter plots visualizing the `cars` and `pressure` datasets. These two plots are separated by some additional white space. The plot on the bottom visualizes the `mtcars` dataset and is wider and taller than the other two plots combined. This plot is an 11 by 11 grid plotting each of the 11 variables in the `mtcars` dataset against each other as a scatterplot. Instead of scatter plots in the diagonal starting in the upper left and going to the lower right are text labels for each of the variable names. These are: 'mpg', 'cyl', 'disp', 'hp', 'drat', 'wt', 'qsec', 'vs', 'am', 'gear', and 'carb'.
QuartoFigures – Quarto
#rstats#quarto#rtfm

#quarto #rstats friends who use github action to publish articles:

it's currently taking github actions ~30 mins to publish my little #mgcv help site (calgary.converged.yt/). This seems to be because it's installing a lot of R packages from source.

What's the current state-of-the-art to get these things to render quickly? (And using minimal power.)

(I'd like to not use github but I would also like to encourage PRs etc from folks without a huge overhead from them, so let's stick to github-based solutions for now.)

calgary.converged.ytYes! You can do that in mgcv – Yes! You can do that in mgcv!