What are the longest HTTP header names and values? I dug into the HTTP Archive to find out: https://www.fastly.com/blog/the-lengthiest-http-headers #http #webperf #httpArchive
What are the longest HTTP header names and values? I dug into the HTTP Archive to find out: https://www.fastly.com/blog/the-lengthiest-http-headers #http #webperf #httpArchive
Congrats to our founder @MichaelLewittes, as well as @dwsmart, @jammer_volts, Mikael Araujo, and @tunetheweb for their work on the SEO chapter of the HTTP Archive's Web Almanac!
You can read it here: https://almanac.httparchive.org/en/2024/seo
Was an absolute pleasure writing/editing the SEO chapter of the HTTP Archive's Web Almanac with
@dwsmart, @jammer_volts and Mikael Araujo!
Special shoutout as well to @tunetheweb!
You can read it here: https://almanac.httparchive.org/en/2024/seo
The latest edition of the #WebAlmanac by #HTTPArchive has been released: https://almanac.httparchive.org/en/2024/sustainability. The Sustainability chapter is full of great advice to reduce the carbon footprint of your website. Some observations from the report
The 2024 Web Almanac has been published:
https://almanac.httparchive.org/en/2024
I have contributed to the chapters on accessibility and sustainability.
Microcks 1.8.0 is out in the wild!
1st release since we joined the #cncf, and our theme is definitely #Open! Open to #community, open to new usages with #AI and #HttpArchive, open to new #shiftleft #devexp with @Testcontainers, open to you!
And some more:
Content-Type: image / png
Content-Type: image/$JPG
Content-Type: image%2Fjpeg
Content-Type: images/gif
Content-Type: max-age=1555200
Content-Type: plain/txt
Content-Type: test/plain
Content-Type: text/htmml
Content-Type: text/javasciprt
Content-Type: text/javascriipt
Content-Type: text\html
Content-Type: type
Content-Type: TYPE/SUBTYPE
Content-Type: UTC
Content-Type: UTF-8
Content-Type: width="1280" height="720"
Interesting Content-Types I have noticed in the 2023-07 mobile HTTP Archive crawl:
Content-Type: [*/*]
Content-Type: */*
Content-Type: #<Mime::NullType:0x0000000cf50828>
Content-Type: <content-typeheader>
Content-Type: <img/>
Content-Type: $MIMETYPE
Content-Type: 2
Content-Type: AddType font/woff
Content-Type: all/all
Content-Type: application/jason
Content-Type: application/jon
Content-Type: Default
Content-Type: FALSE
Content-Type: IMAGE
Is there anyone who recently configured HTTP Archive dataset on GCP BigQuery? I struggle to do so based one this guide: https://github.com/HTTPArchive/httparchive.org/blob/main/docs/gettingstarted_bigquery.md https://github.com/HTTPArchive/httparchive.org/blob/main/docs/gettingstarted_bigquery.md
If anyone can help, or run 1-2 queries for me, that would be fab! Thanks a ton!
Recently I've been building reports based upon HTTP Archive data. Rather than call BigQuery, I instead export the data I'm interested in into Parquet format and then query it locally on my laptop using DuckDB. Here's how I did it: https://discuss.httparchive.org/t/querying-the-http-archive-with-duckdb/2568
#httpArchive #parquet #DuckDB
And my favourite is:
Last-Modified: Invalid Date
... which was seen on 119 responses from the #httpArchive 2023-04-01 mobile run.
#http
Is this common? I queried the HTTP Archive and it found 121,058 double weak resources in the 2022-12-01 dataset with ETags that start with W/W/. That's 0.008% of all the resources. Good news: I found no triple weak validators.
#httpArchive