Mastodon.world admins @mwadmin

**Leon Brocard** @orangeacme@fosstodon.org · Feb 11 *

Leon Brocard @orangeacme@fosstodon.org

What are the longest HTTP header names and values? I dug into the HTTP Archive to find out: https://www.fastly.com/blog/the-lengthiest-http-headers #http #webperf #httpArchive

www.fastly.comThe lengthiest HTTP headers | FastlyDiscover how large HTTP headers can impact your web page's loading speed. Learn about essential headers and strategies to optimize their size for better performance.

**Ranktify** @Ranktify@seocommunity.social · Dec 2, 2024

Dec 2, 2024

Ranktify @Ranktify@seocommunity.social

Congrats to our founder @MichaelLewittes, as well as @dwsmart, @jammer_volts, Mikael Araujo, and @tunetheweb for their work on the SEO chapter of the HTTP Archive's Web Almanac!

You can read it here: https://almanac.httparchive.org/en/2024/seo

#WebAlmanac #HTTPArchive #SEO

**Michael Lewittes** @MichaelLewittes@seocommunity.social · Dec 2, 2024

Dec 2, 2024

Michael Lewittes @MichaelLewittes@seocommunity.social

Was an absolute pleasure writing/editing the SEO chapter of the HTTP Archive's Web Almanac with
@dwsmart, @jammer_volts and Mikael Araujo!

Special shoutout as well to @tunetheweb!

You can read it here: https://almanac.httparchive.org/en/2024/seo

#WebAlmanac #HTTPArchive #SEO

**.ECO domain** @doteco@mastodon.eco · Nov 27, 2024

Nov 27, 2024

.ECO domain @doteco@mastodon.eco

The latest edition of the #WebAlmanac by #HTTPArchive has been released: https://almanac.httparchive.org/en/2024/sustainability. The Sustainability chapter is full of great advice to reduce the carbon footprint of your website. Some observations from the report

almanac.httparchive.orgSustainability | 2024 | The Web Almanac by HTTP ArchiveSustainability chapter of the 2024 Web Almanac covering environmental impacts of web pages, where they come from and how to reduce them.

**Mike Gifford, CPWA** @mgifford@mastodon.social · Nov 11, 2024

Nov 11, 2024

Mike Gifford, CPWA @mgifford@mastodon.social

The 2024 Web Almanac has been published:

https://almanac.httparchive.org/en/2024

I have contributed to the chapters on accessibility and sustainability.

almanac.httparchive.orgThe 2024 Web AlmanacThe Web Almanac is an annual state of the web report combining the expertise of the web community with the data and trends of the HTTP Archive.

#WebAlmanac #WebAlmanac24 #HTTPArchive

**Microcks** @microcksio@mastodon.social · Oct 18, 2023

Oct 18, 2023

Microcks @microcksio@mastodon.social

Microcks 1.8.0 is out in the wild!

1st release since we joined the #cncf, and our theme is definitely #Open! Open to #community, open to new usages with #AI and #HttpArchive, open to new #shiftleft #devexp with @Testcontainers, open to you!

See https://microcks.io/blog/microcks-1.8.0-release/

Continued thread

**Leon Brocard** @orangeacme@fosstodon.org · Aug 17, 2023

Aug 17, 2023

Leon Brocard @orangeacme@fosstodon.org

And some more:

Content-Type: image / png
Content-Type: image/$JPG
Content-Type: image%2Fjpeg
Content-Type: images/gif
Content-Type: max-age=1555200
Content-Type: plain/txt
Content-Type: test/plain
Content-Type: text/htmml
Content-Type: text/javasciprt
Content-Type: text/javascriipt
Content-Type: text\html
Content-Type: type
Content-Type: TYPE/SUBTYPE
Content-Type: UTC
Content-Type: UTF-8
Content-Type: width="1280" height="720"

#httpArchive #web

**Leon Brocard** @orangeacme@fosstodon.org · Aug 17, 2023

Aug 17, 2023

Leon Brocard @orangeacme@fosstodon.org

Interesting Content-Types I have noticed in the 2023-07 mobile HTTP Archive crawl:

Content-Type: [*/*]
Content-Type: */*
Content-Type: #<Mime::NullType:0x0000000cf50828>
Content-Type: <content-typeheader>
Content-Type: <img/>
Content-Type: $MIMETYPE
Content-Type: 2
Content-Type: AddType font/woff
Content-Type: all/all
Content-Type: application/jason
Content-Type: application/jon
Content-Type: Default
Content-Type: FALSE
Content-Type: IMAGE

#httpArchive #web

**Paweł Grzybek** @pawelgrzybek@mastodon.social · May 24, 2023

May 24, 2023

Paweł Grzybek @pawelgrzybek@mastodon.social

Is there anyone who recently configured HTTP Archive dataset on GCP BigQuery? I struggle to do so based one this guide: https://github.com/HTTPArchive/httparchive.org/blob/main/docs/gettingstarted_bigquery.md https://github.com/HTTPArchive/httparchive.org/blob/main/docs/gettingstarted_bigquery.md

If anyone can help, or run 1-2 queries for me, that would be fab! Thanks a ton!

GitHubhttparchive.org/gettingstarted_bigquery.md at main · HTTPArchive/httparchive.orgThe HTTP Archive website hosted on App Engine. Contribute to HTTPArchive/httparchive.org development by creating an account on GitHub.

#httparchive

**Leon Brocard** @orangeacme@fosstodon.org · May 3, 2023

May 3, 2023

Leon Brocard @orangeacme@fosstodon.org

Recently I've been building reports based upon HTTP Archive data. Rather than call BigQuery, I instead export the data I'm interested in into Parquet format and then query it locally on my laptop using DuckDB. Here's how I did it: https://discuss.httparchive.org/t/querying-the-http-archive-with-duckdb/2568
#httpArchive #parquet #DuckDB

HTTP ArchiveQuerying the HTTP Archive with DuckDBI needed to explore ETag response headers locally and I’ve come up with a workflow for querying HTTP Archive data on my laptop in Parquet format using DuckDB. You may have to create Google Cloud project IDs, Google BigQuery workspaces, Google Cloud Storage buckets etc. Query the HTTP Archive in Google BigQuery to create a new table: SELECT resp_etag, COUNT(*) AS sum FROM `httparchive.summary_requests.2023_02_01_mobile` GROUP BY resp_etag HAVING sum > 1 ORDER BY sum DESC bq query ...

Continued thread

**Leon Brocard** @orangeacme@fosstodon.org · Apr 24, 2023

Apr 24, 2023

Leon Brocard @orangeacme@fosstodon.org

And my favourite is:
Last-Modified: Invalid Date
... which was seen on 119 responses from the #httpArchive 2023-04-01 mobile run.
#http

Continued thread

**Leon Brocard** @orangeacme@fosstodon.org · Jan 17, 2023

Jan 17, 2023

Leon Brocard @orangeacme@fosstodon.org

Is this common? I queried the HTTP Archive and it found 121,058 double weak resources in the 2022-12-01 dataset with ETags that start with W/W/. That's 0.008% of all the resources. Good news: I found no triple weak validators.
#httpArchive

Recent searches

Search options

Administered by:

Server stats:

#httparchive