FEED Validator

for Atom and RSS and KML

Congratulations!

This is a valid Atom 1.0 feed.

Recommendations

This feed is valid, but interoperability with the widest range of feed readers could be improved by implementing the following recommendations.

line 8, column 12: subtitle should not be blank [help]

  <subtitle></subtitle>
            ^

line 182, column 0: content should not contain role attribute (8 occurrences) [help]
```
This is (obviously) the nuclear option. It puts Apple outside the jurisdi ...
```

Source: http://www.mnot.net/blog/index.atom

<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
<title>mark nottingham</title>
<link rel="alternate" type="text/html" href="https://www.mnot.net/blog/" />
<link rel="self" type="application/atom+xml" href="https://www.mnot.net/blog/index.atom" />
<id>tag:www.mnot.net,2010-11-11:/blog//1</id>
<updated>2025-06-28T09:00:26Z</updated>
<subtitle></subtitle>
<entry>
<title>Using AI to Evaluate Internet Standards</title>
<link rel="alternate" type="text/html" href="https://www.mnot.net/blog/2025/06/04/using_ai" />
<id>https://www.mnot.net/blog/2025/06/04/using_ai</id>
<updated>2025-06-04T00:00:00Z</updated>
<author>
<name>Mark Nottingham</name>
<uri>https://www.mnot.net/personal/</uri>
</author>
<summary>Is AI a useful option for policymakers who want to evaluate open standards? Let's take a look.</summary>
<category term="Standards" />
<category term="Web and Internet" />
<content type="html" xml:lang="en" xml:base="https://www.mnot.net/blog/2025/06/04/using_ai">
<![CDATA[One of the reasons I’m writing the <a href="/blog/series/internet-standards/">Nature of Internet Standards</a> series is to give policymakers tools that help them to understand and evaluate open standards efforts. A big part of that is understanding whether a specification (and the effort that led to it) is really a reflection of open deliberation, consensus processes, appropriate governance, and various other hallmarks of open standards.
Evaluating these factors requires a fair amount of context and nuance. What’s the most 2025 way to avoid that work? Using AI, of course.
Some colleagues were using AI to ask similar questions, and that inspired me to wonder whether it would be a useful option for policymakers who want to evaluate the openness of Internet standards. Let’s find out.
<h2 id="the-privacy-sandbox">The Privacy Sandbox</h2>
Our first experiment is an effort that I’m fairly familiar with, having engaged with it both professionally and <a href="https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3891335">academically</a> – Google’s Privacy Sandbox. How will AI portray this longstanding and very politicised effort?
<h3 id="prompt">Prompt</h3>
I started by asking ChatGPT’s “deep research” mode:
<blockquote>
Is the Privacy Sandbox developed in an open standards process? Consider the venue(s) it’s developed in, their governance, openness, and conformance to other widely recognised signifiers of open standards processes, breadth of implementation, and adoption.
</blockquote>
It asked for a few clarifications:
<blockquote>
To give you the most accurate and up-to-date analysis, could you clarify a few things:
<ol>
<li>Are you primarily interested in the Privacy Sandbox on the web (i.e., as developed by Chrome and the W3C), or are you also considering the Android Privacy Sandbox?</li>
<li>Do you want a comparison to established open standards processes (e.g., W3C, IETF), or just an evaluation of the Privacy Sandbox initiatives on their own merits?</li>
<li>Are you looking for an overview of industry support (browsers, advertisers, publishers, etc.) and real-world implementation as part of your evaluation of adoption?</li>
</ol>
This will help me tailor the research more precisely.
</blockquote>
To which I replied:
<blockquote>
Just on the Web. A comparison to open standards processes would be good. I don’t need an overview of support and implementation, but the results should take these factors into account.
</blockquote>
<h3 id="response">Response</h3>
After churning through data (and pushing the boundaries of copyright) for a couple of minutes, ChatGPT came up with <a href="https://chatgpt.com/s/dr_68252c4763a08191b9339f52d244060e">this voluminous report</a>. However, the heart of its response is concise and fairly damning:
<blockquote>
In summary, the Privacy Sandbox has been developed in semi-open forums (W3C community groups, public repos, Google blogs) and invites outside input, but it is not a widely supported multi-vendor standard yet. Only Google’s Chrome (and related browsers) implements the APIs, and other browser vendors have developed their own approaches. The process resembles an open-source Google project more than a classic consensus-driven SDO. As Mozilla and others have noted, standards bodies like the W3C and IETF are the “ideal forum” to vet such proposals with full stakeholder oversight. At present, Privacy Sandbox’s governance does not fully align with those open standards processes – it provides considerable transparency and documentation, but its development and adoption remain largely controlled by Google.
</blockquote>
That’s not far off. W3C Community Groups aren’t standards processes; in fact, they’re little more than mailing lists where the proponents get to set almost all of the rules. ChatGPT could have made this more clear, but instead uses lots of hedging phrases like “more than” and “fully align”.
Since the full report mentioned the chair of the Business Group that also discussed the Privacy Sandbox, <a href="http://wendy.seltzer.org">Wendy Seltzer</a>, I asked what she thought. Her response:
<blockquote>
It’s fascinating to follow the sources here, because the Wikipedia article appears to have gotten its last substantive updates in 2024, and is mostly focused on older material ([which is] sometimes wrong). A human researcher would recognize that as not the best authority for recent news. A better human researcher would also ask “is this the question you want to ask, since the non-deprecation of third party cookies has changed the landscape around all these initiatives?”
That illuminates a broader issue of missing context. The answer starts from the premise that “Privacy Sandbox” is something that could be developed in an open standards process, rather than parsing the question, is PS more than the set of APIs and proposals that might fit into standards processes? How do open standards fit into the overall strategy? The GPT report can work for you, an expert in the field who can fill in those aspects, but it could be misleading to someone without the same background.
Some of the sentences quoted from other sources seem off without the context of those sources, e.g. “In January 2020 Google invited ad-tech companies to join the W3C’s Improving Web Advertising Business Group (IWABG) to discuss and give feedback on Privacy Sandbox ideas” reads as implying that Google was the only one doing the inviting.
</blockquote>
<h2 id="quic">QUIC</h2>
Next, let’s try another effort that I’m familiar with: QUIC, the effort to define a new, performance and security-focused transport protocol. I chaired the <a href="https://quicwg.org/">QUIC Working Group</a> (along with Lars Eggert) from its inception until it shipped its core deliverables.
<h3 id="prompt-1">Prompt</h3>
I asked ChatGPT’s “deep research” mode a similar question for QUIC:
<blockquote>
Is QUIC developed in an open standards process? Consider the venue(s) it’s developed in, their governance, openness, and conformance to other widely recognised signifiers of open standards processes, breadth of implementation, and adoption.
</blockquote>
and once again, it asked for a similar clarification:
<blockquote>
To best address your question about QUIC, could you please clarify what aspect you’re most interested in?
<ul>
<li>Are you focused more on the technical development and governance process (e.g., how open the standardization process is)?</li>
<li>Or are you more interested in its current implementation and adoption across the industry?</li>
<li>Should I compare it to other protocols or standards in terms of openness and adoption?</li>
</ul>
Let me know so I can tailor the research accordingly.
</blockquote>
I responded:
<blockquote>
Focus on how open the standardisation process was, but consider implementation and adoption in your answer. A comparison with other relevant standards would be helpful.
</blockquote>
<h3 id="response-1">Response</h3>
Here’s the <a href="https://chatgpt.com/share/68253995-9f5c-8009-9dae-405a071aed61">full report</a>. The summary is positively glowing:
<blockquote>
In conclusion, QUIC’s standardization has been as open and transparent as any recent IETF protocol. It was developed in the open IETF process, adhering to OpenStand and WTO/TBT ideals of consensus, transparency, and voluntary adoption. Its final RFCs affirm community consensus, and the IETF infrastructure provided public visibility into every stage. This openness is mirrored by QUIC’s broad support: independent implementers from Google, Mozilla, Apple, Cloudflare, Facebook, and beyond have all deployed QUIC. The resulting ecosystem – with multiple browsers, CDNs, servers and apps supporting QUIC – reinforces that it is a truly open standard, not controlled by any single vendor. By contrast, older protocols like HTTP/2 and TCP underwent similar open processes (IETF consensus, public RFCs) and likewise enjoy ubiquitous implementation. Together, the history and adoption of QUIC demonstrate an effective, open standardization process in line with global best practices.
</blockquote>
First of all, I love the reference to the “WTO/TBT ideals of consensus, transparency, and voluntary adoption.”
But is it accurate? Sort of. The facts here are correct: those companies did participate in the effort, implemented it, and deployed it. The process was open (as can be seen in the minutes and other materials).
I don’t know, however, how it justifies its statements about the quality of consensus – as I’ve written before, <a href="/blog/2024/05/24/consensus">consensus can go wrong in a variety of ways</a>. Others have <a href="https://journals.sagepub.com/doi/full/10.1177/14614448251336438">characterised</a> QUIC’s development has with Google firmly in the “driving seat.” Having been there for the whole standards process, I disagree with that assessment, but I’d at least expect academic work like that to be acknowledged.
<h2 id="is-ai-useful-for-assessing-open-standards">Is AI Useful for Assessing Open Standards?</h2>
I can’t count the number of times that I’ve seen policymakers, journalists, and community members refer to the Privacy Sandbox as “at the W3C” or “being standardised at the W3C.” Given that extremely low bar, ChatGPT’s summary is an improvement. Likewise, I largely agree with its assessment of QUIC, at a high level.
What’s lacking here, however, is any kind of nuance. I can’t escape the feeling that it latches onto a few narratives that appear in source materials and augments them into well-worn clichés, like we see for QUIC. The IETF has a great reputation in many sources, so that gets amplified, but there’s a lack of any critical thought.
That’s not surprising: AI can’t think. If it could, it might wonder about the criteria we’re using for “open standards” here – are those WTO/TBT ideals still relevant, and are they adequately described? Are the processes actually used in working groups lining up with the rhetoric of openness – and how would you find out if they didn’t? And, how much should all of that count if the result isn’t <a href="https://www.mnot.net/blog/2024/03/13/voluntary">proven by market adoption</a>?
In a nutshell: if you must use AI to assess the openness of a standard, only use it for the first pass, check all of the references, and then roll up your sleeves and start talking to people to get the real story.]]>
</content>
</entry>
<entry>
<title>Apple’s Best Option: Decentralize iCloud</title>
<link rel="alternate" type="text/html" href="https://www.mnot.net/blog/2025/02/09/decentralize-icloud" />
<id>https://www.mnot.net/blog/2025/02/09/decentralize-icloud</id>
<updated>2025-02-09T00:00:00Z</updated>
<author>
<name>Mark Nottingham</name>
<uri>https://www.mnot.net/personal/</uri>
</author>
<summary>What can Apple do in the face of a UK order to weaken encryption worldwide? Decentralize iCloud, to start.</summary>
<category term="Tech Regulation" />
<category term="Web and Internet" />
<content type="html" xml:lang="en" xml:base="https://www.mnot.net/blog/2025/02/09/decentralize-icloud">
<![CDATA[As has been <a href="https://www.washingtonpost.com/technology/2025/02/07/apple-encryption-backdoor-uk/">widely reported</a>, the government of the United Kingdom has secretly ordered Apple to build a back door into iCloud to allow ‘blanket capability to view fully encrypted material.’
Assuming the UK doesn’t back down, what are Apple’s options? This is my personal take: if I’ve missed something, I’d love to hear about it.
<h3 id="option-1-comply">Option 1: Comply</h3>
Most companies would just comply with the order, but Apple is not most companies.
That’s not just because they have <a href="https://www.apple.com/privacy/">marketed themselves as privacy and security conscious</a>, although that presumably factors into their decision. From what I’ve seen from interacting with their engineers and observing how they behave (both in technical standards bodies and in their products), this is a commitment that goes much deeper than just marketing.
More significantly, Apple will be considering the secondary and tertiary consequences of compliance. So far, every democratic country around the world has refrained from making such an order; for example, Australia’s <a href="https://www.mnot.net/blog/2018/11/15/trust_australia">widely debated</a> legislation that mirrors the UK “Snooper’s Charter” has an explicit provision to disallow “systemic weakening” of encryption like we see here.
If the UK successfully forces Apple’s hand, every other government in the world is likely to take notice and consider making similar (or even more extreme) demands. <a href="https://www.wired.com/story/apple-photo-scanning-csam-communication-safety-messages/">CSAM scanning</a> will just be the start: once access to that much data is available, it’s open season for everything from <a href="https://en.wikipedia.org/wiki/Lèse-majesté_in_Thailand">Lèse-majesté</a> to punishing activists and protesters to policing sexual orientation, abortion, and other socially motivated laws. Even if a particular country doesn’t make the same demand of Apple, arrangements like Five Eyes will allow one agency to peer over another’s shoulders.
As I’ve written before, <a href="https://www.mnot.net/blog/2024/04/29/power">no one should have that much power</a>.
In the tinderbox that politics has become in many parts of the world, this is gasoline. I’d pay good money to be a fly on the wall in the meetings taking place with the <a href="https://www.gov.uk/government/organisations/foreign-commonwealth-development-office">Foreign Service</a>, because they of all people should understand the potential global impact of a move like this. Of course, in a world where USAID is shut down by Elon Musk and some teenagers, nothing is off the table – and that’s why we should be so concerned about this outcome.
<h3 id="option-2-leave">Option 2: Leave</h3>
Apple’s second option is to leave the UK. Full stop.
Close the Apple stores, online and retail. Stop providing iCloud, stop selling iPhones and all the other various i-gear. Close the <a href="https://www.businessinsider.com/apple-uk-headquarters-14-billion-redevelopment-battersea-power-station-2016-10">beautiful new UK HQ at Battersea</a>, and lay off (or transfer overseas) around 8,000 employees (reportedly).
This is (obviously) the nuclear option. It puts Apple outside the jurisdiction of the UK,<a href="#fn:1" class="footnote" rel="footnote" role="doc-noteref">1</a> and at the same time orphans every UK Apple user – their phones and computers don’t quite become bricks, but they will definitely have limited utility and lifetime.
Given that along with Apple’s claim to <a href="https://www.standard.co.uk/business/apple-s-spectacular-offices-at-battersea-power-station-b1110130.html">support 550,000 UK jobs</a>, it’s likely to be effective – these consequences would make the government extremely unpopular overnight.
However, this option is also massively expensive: reportedly, total Apple revenue in the UK is <a href="https://www.retailgazette.co.uk/blog/2023/07/apple-uk-sales/">something like £1.5bn</a>. Add on top the one-time shutting down costs, and even Apple’s balance sheet will notice.
Perhaps more importantly, this is also a strategically worrisome direction to go in, because it plays into the narrative that Big Tech is more powerful than sovereign nations. Other countries will take notice, and may coordinate to overcome Apple’s reticence. Apple will now have to choose the markets that it operates in based on how it feels about those country’s commitments to human rights on an ongoing basis – hardly a situation that any CEO wants to be in.
Finally, this option simply won’t work if one of those countries is the United States, Apple’s home. I’ll leave it to you, dear reader, to decide how much you trust your predictions of its actions.
<h3 id="option-3-open-up">Option 3: Open Up</h3>
Apple’s third option is to remove itself as a target in a more subtle way than option two.
The UK is presumably interested in Apple providing this functionality because iCloud’s design conveniently makes a massive amount of data convenient to access in one location: Apple’s servers. If that data is instead spread across servers operated by many different parties, it becomes less available.
In effect, this is the decentralize iCloud option. Apple would open up its implementation of iCloud so that third-party and self-hosted providers could be used for the same functions. They would need to create interfaces to allow switching, publish some specifications and maybe some test suites, and make sure that there weren’t any intellectual property impediments to implementation.
There could be some impact on Apple revenue here, but I suspect it’s not huge; many people would continue to buy iCloud for convenience, and for non-storage features that Apple bundles in <a href="https://www.apple.com/icloud/">iCloud+</a>.
Think of it this way: Apple provides e-mail service with iCloud, but doesn’t require you to use it: you can use your own or a third party provider without any drama, because they use common protocols and formats. Why should file sync be any different? Why can’t Apple make using a third-party service as seamless and functional as iCloud?
This isn’t a perfect option. Orders could still force weakened encryption, but now they’d have to target many different parties (depending on the details of implementation and deployment), and they’d have to get access to the stored data. If you choose a provider in another jurisdiction, that makes doing so more difficult, depending on what legal arrangements are in place between those jurisdictions; if you self-host, they’ll need to get physical access to your disks.
<h3 id="what-will-and-should-apple-do">What Will (and Should) Apple Do?</h3>
Computer operating systems are fundamental to security: once we lose trust in them, it’s pretty much game over. The UK has chosen a risky and brash path forward, and Apple will need to think carefully about how to navigate it.
It should be no surprise that I favour option three. While Apple is notoriously a closed company, it’s not completely averse to collaborating and working in the open when doing so is in its interests – and, given its other options, that’s arguably the case here.
Conceivably, Apple might even be forced into taking the “decentralize iCloud” option if regulators like those implementing the Digital Markets Act in the EU decide that doing so is necessary for competition. Apple has been <a href="https://ec.europa.eu/competition/digital_markets_act/cases/202344/DMA_100025_228.pdf">designated as a gatekeeper</a> for the ‘core platform service’ provided by iOS, and while that designation currently doesn’t include file synchronisation services, that might change.
Of course, the UK government may back down. However, the barrier to some other government taking similar steps is now smaller, and Apple would do well to consider its longer term options even if action turns out to be unnecessary right now.
Thanks to <a href="https://eupolicy.social/@1br0wn">Ian Brown</a> for his input to this article.
<div class="footnotes" role="doc-endnotes">
<ol>
<li id="fn:1">
Presumably. Both inter-jurisdictional coordination and extraterritorial application of the law may complicate that. IANAL. <a href="#fnref:1" class="reversefootnote" role="doc-backlink">↩</a>
</li>
</ol>
</div>]]>
</content>
</entry>
<entry>
<title>Platform Advantages: Not Just Network Effects</title>
<link rel="alternate" type="text/html" href="https://www.mnot.net/blog/2024/11/29/platforms" />
<id>https://www.mnot.net/blog/2024/11/29/platforms</id>
<updated>2024-11-29T00:00:00Z</updated>
<author>
<name>Mark Nottingham</name>
<uri>https://www.mnot.net/personal/</uri>
</author>
<summary>A new book explores an intriguing idea: that there are core processes in some platforms that naturally tilt the table towards being implemented in a single company.</summary>
<category term="Tech Regulation" />
<category term="Web and Internet" />
<content type="html" xml:lang="en" xml:base="https://www.mnot.net/blog/2024/11/29/platforms">
<![CDATA[Over the past few years, there’s been growing legal and academic interest in platforms — their functioning, potential harms, and advantages over competitors.
On that last question, most of the literature that I’ve seen has focused on factors like network effects and access to data. However, a forthcoming book by <a href="https://www.hbs.edu/faculty/Pages/profile.aspx?facId=6417">Carliss Baldwin</a> proposes some significant additional – and structural – advantages that accrue to those who control them. <a href="https://mitpressbookstore.mit.edu/book/9780262049337">Design Rules Volume 2: How Technology Shapes Organizations</a><a href="#fn:1" class="footnote" rel="footnote" role="doc-noteref">1</a>
builds on Volume One (which I <a href="https://www.mnot.net/blog/2024/05/10/design-rules-vol-one">wrote about earlier</a>) with a goal to ‘build and defend a general theory explaining how technologies affect the structure and evolution of organizations that implement the technologies.’
Baldwin argues that “whether a technology will generate the most value through single, unified corporations, through platform-based business ecosystems, or through open source projects depends on the balance of complementarity within the technical system.” Let’s unpack that (in my words, with apologies for any misinterpretation of her work).
<h3 id="modularity">Modularity</h3>
Imagine a technical system, such as a service provided across the Internet, that comprises numerous components. This is a common occurrence because, as mentioned in Volume One, we manage complexity through modularity. We break down tasks into smaller units that can be distributed among many individuals, preventing any single person from having to comprehend the entire system’s intricacies.
These components can have various degrees of coupling – i.e., interdependency. While we always strive for loose coupling, which allows for easy modification or replacement of components without affecting others, it’s not always feasible to avoid tight coupling when there are close dependencies.
<h3 id="coupling-and-governance">Coupling and Governance</h3>
Baldwin points out that systems with many tightly coupled functions are better situated in a single company due to the ease of managing these relationships within the hierarchical and closely related environment of a modern corporation. Conversely, she suggests that those with very loosely coupled functions are more appropriate for implementation across multiple entities because this arrangement enables the generation of greater overall value.
In the middle lies a “Goldilocks zone,” where some amount of coordination is necessary, but there’s still a benefit to distributing functions amongst many actors. These conditions allow formation of a business ecosystem – a set of “independent organizations and individuals engaged in complimentary activities and investments.” As Baldwin points out:
<blockquote>
Ecosystems rely on distributed governance, meaning that each member has the right to make certain decisions according to his or her own interests and perceptions. In place of direct authority, coordination of an ecosystem requires negotiation among members with different priorities and interests.
</blockquote>
<h3 id="platforms">Platforms</h3>
There are many examples of such distributed governance schemes, including Open Source and Open Standards. However, it’s hard to ignore the dominance of platforms in the current landscape, which she defines as ‘a technological means of coordinating design, production, and exchange within modular architectures.’ Platforms aren’t so much distributed governance schemes as they are centralised control points (or even <a href="https://www.rfc-editor.org/rfc/rfc9518.html">choke points</a>).
She then goes on to break down a typology of platforms, with particular focus on transaction platforms, like eBay, Amazon, and Chrono24 – and communication platforms, such as Facebook, Bluesky, and X.
Here’s where things get really interesting. Baldwin argues that certain core processes which are essential to implement these types of platforms are bound to be tightly coupled, thereby heavily tilting the table towards implementation by a single company:
<blockquote>
The need for tight integration of core processes is the first reason for-profit corporations subject to unified governance have replaced organizations subject to distributed governance in almost all digital exchange platforms. Traditional exchange processes did not require the same high degree of synchronization as algorithmic processes.
</blockquote>
In transaction platforms, she identifies search and ad placement, dynamic pricing, and data analysis and prediction as processes that must occur within milliseconds to provide a satisfactory user experience. For communication platforms, the relevant core services are search and ad placement, ad selection, dynamic pricing of an ad, and (again) data analysis and prediction.
<h3 id="what-this-means-for-the-internet">What this means for the Internet</h3>
Yes, search is more difficult on a federated platform like Mastodon, but it’s possible if you relax the need for immediate updates – as it can be if you <a href="https://berjon.com/fixing-search/#how-does-search-work-">rework the relationships in that arena</a>. When you get past that, it’s also hard not to notice that these core processes are mostly advertising-related.
And that’s crucial. These companies have stepped in to solve coordination problems (“How do we communicate around the globe? How do we do transactions with people we haven’t met?”) by creating platforms that fully exploit their centralization. They are supported by real-time advertising systems because the table is tilted towards that outcome, and building a real-time advertising-supported ecosystem with distributed governance is hard.<a href="#fn:2" class="footnote" rel="footnote" role="doc-noteref">2</a>
Much of that friction goes away if you relax the constraint of being advertising-supported, or even remove the real-time requirement from advertising (e.g., by using contextual advertising). However, you still have a coordination problem, and because real-time advertising is the most lucrative way to monetise a centralized position, decentralizing these systems means big companies won’t be nearly as interested in these outcomes.
The history of the Internet is illustrative here. We had RSS and Atom feeds, but there wasn’t a business model in that: however, there was in ‘news feeds’ on Facebook. We had open messaging protocols like XMPP, but they were supplanted by proprietary chat platforms that wanted to lock their users in and monetise them. Meanwhile, e-mail is being slowly swallowed by GMail and a few others as we helplessly watch.
In short: there are less-recognised structural forces that push key Internet services into centralized, real-time advertising-supported platforms. Along with factors like network effects and access to data, they explain some of why the Internet landscape looks like it does.
Decentralized alternatives must overcome those forces where they can’t be avoided. They also need to be developed and supported, and to compete with those centralised platforms, they will need to be well-funded. To go back to the RSS/Atom example, there is a <a href="https://www.mnot.net/blog/2024/08/25/feeds">lot of work that could improve that ecosystem</a>, but no one has a strong incentive to do so.
In these conditions, ‘build it and they will come’ is insufficient; simply creating Internet standards and Open Source software won’t solve the coordination challenges. Most current Internet companies lack the incentive to fund such efforts since they’re unlikely to accommodate real-time advertising.
Who might? My thoughts turn to the various discussions surrounding <a href="https://www.undp.org/digital/digital-public-infrastructure">Digital Public Infrastructure</a>. Exploring how to make that viable is a crucial (and important) topic that I’ll leave for another day.
This is just one aspect of Design Rules Volume 2; there’s much more to discover in this excellent book. I’ve been enthusiastically recommending it to anyone who takes the time to listen.
Thanks to <a href="https://berjon.com/">Robin Berjon</a> for reviewing this article.
<div class="footnotes" role="doc-endnotes">
<ol>
<li id="fn:1">
To be published on 24 December. Many thanks to Professor Baldwin for an early copy. <a href="#fnref:1" class="reversefootnote" role="doc-backlink">↩</a>
</li>
<li id="fn:2">
Again, not necessarily impossible; for example, look at what Mozilla et al are doing in the <a href="https://patcg.github.io">Private Advertising Technology</a> effort. <a href="#fnref:2" class="reversefootnote" role="doc-backlink">↩</a>
</li>
</ol>
</div>]]>
</content>
</entry>
<entry>
<title>On Opting Out of Copyright</title>
<link rel="alternate" type="text/html" href="https://www.mnot.net/blog/2024/09/18/opt-out" />
<id>https://www.mnot.net/blog/2024/09/18/opt-out</id>
<updated>2024-09-18T00:00:00Z</updated>
<author>
<name>Mark Nottingham</name>
<uri>https://www.mnot.net/personal/</uri>
</author>
<summary>The EU AI Act and emerging practice flip copyright’s default opt-in regime to an opt-out one. What effects is this likely to have on the balance of power between rights holders and reuse?</summary>
<category term="Tech Regulation" />
<category term="Web and Internet" />
<content type="html" xml:lang="en" xml:base="https://www.mnot.net/blog/2024/09/18/opt-out">
<![CDATA[The EU AI Act and emerging practice flip copyright’s default opt-in regime to an opt-out one. What effects is this likely to have on the balance of power between rights holders and reuse?
Copyright is a default opt-in regime, from the standpoint of the rights holder. If I publish something on this blog, the presumption is that I retain rights unless I specifically license them – for example, by attaching a <a href="http://creativecommons.org">creative commons</a> license. If I don’t do that, you can’t legally reuse my content (unless your use falls within certain exemptions).
You can think about this arrangement in terms of protocol design: it’s an agreement between parties whose nature creates certain incentives and barriers to behaviour. Someone who wants to reuse my content has the burden of getting a license from me, and proving that they have one if I challenge them. I have the burden of finding misuse of my content and pursuing it.
Technical systems can assist both parties in these tasks. I can use search engines of various sorts to find potential abuses; a licensee can prove that a particular license was available by showing its existence in the cache of a disinterested third party (often, one of the same search engines).
This creates an equilibrium: the burdens are balanced to favour certain behaviours. You might argue that the balance is unjust (and many <a href="https://pluralistic.net">do</a>), but it is known and stable.
As <a href="https://www.mnot.net/blog/2024/04/21/ai-control">discussed previously</a>, the EU AI Act and emerging practice flip copyright’s default opt-in regime to an opt-out one. A rights holder now has to take positive action if they want to reserve their rights. While on the face of it they still have the same capability, this ends up being a significant practical shift in power.
That’s partly because of the nature of opt-out itself. The burden shifts: now, the rights holder must find misuse of their content, and prove that they opted out.
Proving that you consistently opted out at every opportunity is difficult, because it’s effectively proving a negative – that you never failed to opt out. Search engines don’t see every request made on the Internet; they just crawl it periodically, sampling what they see. An AI crawler can plausibly claim that the opt out wasn’t present when they crawled, and the rights holder is reduced to proving that the teapot isn’t in orbit.
Notably, this is the case whether the opt-out is attached to the content by a mechanism like robots.txt or if it’s embedded in the content itself as metadata. In the former case, content without the opt-out might be obtained at a different location, or at a different time; in the latter, the opt-out might be stripped from the content or a copy of it, either intentionally or unintentionally (e.g. it is a common to strip metadata from images to optimise performance and improve privacy).
On top of that, using this regime for AI makes finding misuse difficult too. There’s no easy way to query an LLM for a particular bit of content in the corpus that was used to train it; instead, you have to trust the vendor to tell you what they used. While transparency measures are being discussed as a policy solution to this issue, they don’t have the same properties as third-party or technical verification, in that they require trusting assertions from the vendor.
In this manner, changing copyright’s default opt-in to an opt-out for AI dramatically shifts the burden of compliance to rights holders, and the lack of support for managing those burdens brings into question the practical enforceability of the regime. It could be argued that this is appropriate for policy reasons – in particular, to enable innovation. However, it is a mistake to say it doesn’t represent a change in the balance of power as compared to opt-in.]]>
</content>
</entry>
<entry>
<title>What RSS Needs</title>
<link rel="alternate" type="text/html" href="https://www.mnot.net/blog/2024/08/25/feeds" />
<id>https://www.mnot.net/blog/2024/08/25/feeds</id>
<updated>2024-08-25T00:00:00Z</updated>
<author>
<name>Mark Nottingham</name>
<uri>https://www.mnot.net/personal/</uri>
</author>
<summary>Web feeds could be so much more if we put some effort into them. This post explores how we could start.</summary>
<category term="Internet and Web" />
<category term="Web Feeds" />
<content type="html" xml:lang="en" xml:base="https://www.mnot.net/blog/2024/08/25/feeds">
<![CDATA[More than twenty years ago, Web feeds were all the rage. Not proprietary news feeds on Facebook or ‘X’ – openly defined, direct producer-to-user feeds of information that you had total control over. Without ads. ‘Syndication’ meant that publishers could reach wider audiences without intermediaries; ‘aggregation’ meant that you could get updates from everyone you were interested in without having to hop all over the Web.
I’m talking about RSS and Atom, of course. I have fond memories of the community that launched this, having started the Syndication Yahoo! Group and later going on to co-edit the <a href="https://www.rfc-editor.org/rfc/rfc4287.html">Atom specification</a>. Since that period of busy activity, however, the underlying technology hasn’t seen much care or attention. There are some bright spots – podcasts have effectively profiled RSS to <a href="https://www.anildash.com//2024/02/06/wherever-you-get-podcasts/">create a distributed ecosystem</a>, and ActivityPub has taken the mantle of social feeds – but the core ‘I want to track updates from the Web in a feed reader’ use case has languished.
Despite that lack of attention, the feed ecosystem is flourishing; there are many feeds out there, helped by things like automatic feed generation in platforms such as Wordpress. Right now, I’m subscribed to more than a hundred feeds, tracking everything from personal blogs to academic publications to developments in competition law to senate inquiries, and I check them multiple times a day almost every day.
It’s just that feeds could be so much more with some love and directed care – something that could jump from a niche use case to a widespread ‘normal’ part of the Web for many.
It’s also a good time to revitalise feeds. When Google killed Reader years ago, no one questioned their right to do so, even if we grumbled about it. Now, however, regulators are much more aware of the power that platforms have to tilt markets to their benefit, and many are calling for more decentralised approaches to functions that are currently only provided by concentrated intermediaries. People are also more wary of giving away their e-mail addresses for newsletters (the old-tech solution to feeds) when e-mail addresses are rapidly becoming the replacement for tracking by third-party cookies.
With that in mind, here are some of the areas where I think RSS needs some help.
<h3 id="community">Community</h3>
Communication between implementers of a technology is important; it facilitates coordination for addressing bugs and interoperability problems, and smooths the introduction of new features.
Unfortunately, the feed ecosystem has little such coordination. There are few opportunities for developers of software that consumes or produces feeds to talk. This lack of coordination is compounded by how diverse the ecosystem is: there are many implementations on both sides, so it’s hard to improve things for any one actor.
This situation reminds me of the state of the HTTP protocol in the early 2000’s. Back then, that protocol’s implementers were exhausted, because the Web was still scaling up rapidly, and their software needed to mature to match it. HTTP/1.1 had shipped in the late 90’s, and no one was willing to discuss what comes next: they were too busy. Even if they did want to talk about the protocol, there was no natural ‘home’ for it – and that lack of community resulted in numerous interoperability issues and one-off workarounds (anyone remember the <code class="language-plaintext highlighter-rouge">X-Pad</code> header field?).
What we did to improve HTTP suggests some possible paths forward for feeds. In 2007, we established the <a href="https://httpwg.org/">HTTP Working Group</a> in the IETF to act as a home for the protocol. At first it was just a few people who had the time and interest, but over time we had more and more of the implementer community paying attention, eventually taking on HTTP/2.
Not everyone has the time or willpower to participate in standards, however. So, several years ago we started holding the <a href="https://httpworkshop.org/">HTTP Workshop</a>, an informal community get-together for implementers and practitioners, where we can discuss our experiences, problems, and proposals to keep the protocol healthy and active.
Both of these approaches could be used to revitalise the feed implementer community over time, if we can get a core of people interested.
<h3 id="user-agency">User Agency</h3>
Feed readers are an example of user agents: they act on behalf of you when they interact with publishers, representing your interests and preserving your privacy and security. The most well-known user agents these days are Web browsers, but in many ways feed readers do it better – they don’t give nearly as much control to sites about presentation and they don’t allow privacy-invasive technologies like cookies or JavaScript.
However, this excellent user agency isn’t well-defined, and we don’t even know if it’s consistent from reader to reader. We need a common understanding of what a feed reader is and what it isn’t, so that users can evaluate whether their reader is a ‘good’ one, and so we can make principled decisions about what a feed reader does and doesn’t do when we extend them.
I started to write about this a while back in <a href="https://datatracker.ietf.org/doc/draft-nottingham-feed-privacy/">Privacy Considerations for Web Feed Readers</a>, but it fizzled out due to the lack of an active community (see above).
<h3 id="interoperability-tests">Interoperability Tests</h3>
Feed readers need to behave in predictable, compatible ways; otherwise publishers won’t know how their content will be presented to users, and won’t trust them to do it. Most readers have settled on a latent profile of the Web stack when they show feed content, but it’s uneven, and that variability limits the use cases for Web feeds.
For example, many YouTube content creators are looking for alternatives because they don’t want to be at the mercy of Google’s algorithm; some are setting up their own Web sites to host video, but are finding that it’s difficult to hold their users’ attention in a sea of choices. Feeds could help – if video interoperates cleanly in feed readers. Does it? We have no idea.
Another example: some feeds that I view in the excellent <a href="https://reederapp.com">Reeder</a> show the ‘hero’ image twice: once because it shows up in the entry’s metadata, and once in the content. I suspect that’s the case because the reader that the publisher used didn’t show the metadata-sourced image. Interop tests would have picked this up and helped to create pressure for one way to do it.
Let’s not even get started on feed autodiscovery.
Creating interop tests requires both resources and buy-in from the developer community, but if we want Web feeds to be a platform that publishers create content for, it’s necessary: the Web has set the bar high.
<h3 id="best-practices-for-feeds">Best Practices for Feeds</h3>
Publishers need stronger and more current guidance for how to publisher their feed content. Some of that is the basics: for example, ‘test your feeds, including for autodiscovery’ – although interop issues (per above) makes that a difficult task.
We should also go further and share good patterns. For example, as someone who uses a feed reader, I’m annoyed by all of the ‘subscribe’ banners I see when I click through to the site – it should know that I came from a feed reader. How? If the feed’s links contain a query string that indicates the source, the Web page should be able to hide ‘subscribe’ banners and use cookies to remember that I’m a feed subscriber.
(I can do something about this one more easily than the others by updating the <a href="https://www.mnot.net/rss/tutorial/">RSS and Atom Feed Tutorial</a>. I’ll put that on my TODO list…)
<h3 id="browser-integration">Browser Integration</h3>
Web browsers used to know about feeds: you’d know whether a page had a feed, and could subscribe (either locally, or have it dispatched out to a separate reader). The user experience wasn’t great, but it at least made feeds visible on the Web.
Now, the browser vendors have ripped feed support out, seeming to have the attitude that feeds can be accommodated with extensions. That’s true to a point: Reeder has a Safari extension, for example, and it lets you know when there’s a feed on the page and subscribe to it.
However, using an extension has privacy implications: I need to trust my feed reader to see the content of every Web page I go to serve this function. That’s not great. Also, most users can’t be bothered to walk through the steps of adding extensions: it’s not ‘normal’ on the Web if you have to modify your browser to do it.
Feed support should be built into browsers, and the user experience should be excellent. It should be possible to dispatch to a cloud reader. It should be possible to have customised subscription flows. It should work out of the box so people don’t have to struggle with installing privacy-invasive extensions.
However, convincing the browser vendors that this is in their interest is going to be challenging – especially when some of them have vested interests in keeping users on the non-feed Web.
<h3 id="authenticated-feeds">Authenticated Feeds</h3>
Some publishers want to gate their feeds behind a subscription – for example, <a href="https://arstechnica.com/rss-feeds/">ars technica</a> has both free and subscriber-only feeds. Right now, that’s only possible through clunky mechanisms like <a href="https://w3ctag.github.io/capability-urls/">capability URLs</a>, which are less than ideal for shared clients like Web feed readers, because they can ‘leak’ the subscription information, and don’t benefit from caching.
We might not be able to do better than that, but it’s worth considering: would publishers trust third parties like cloud feed readers enough to delegate subscription authentication to them? What would that look like?
<h3 id="publisher-engagement">Publisher Engagement</h3>
Finally, one of the downsides of feeds from a publisher standpoint is that you get very little information about how your feed is used. That’s a huge benefit from a privacy perspective, but it also hinders adoption of Web feeds, forcing people into subscribing by e-mail (which has its own privacy issues).
I recognise that some are fine with this: personally, most of the feed content I consume isn’t commercial, and it’s great. However, if we can make feeds more ‘normal’ by providing some limited feedback to publishers in a privacy preserving way, that could be very good for the ecosystem.
For example, if the Web browser were able to indicate to the Web site what proportion of a site’s audience uses feed readers, publishers can get an idea of how large their potential feed audience is. Keeping in mind that feed readers are likely much ‘stickier’ than other delivery mechanisms, this could be quite attractive.
Or, if feed readers were able to give publishers an indication of what articles were viewed, that would give them the information they need to optimise their content.
On the Web, these kinds of tasks are currently performed with privacy-invasive technologies, often using third-party cookies. Feed readers could take advantage of newer privacy tech like <a href="https://datatracker.ietf.org/wg/ppm/about/">Privacy Preserving Measurement</a> and <a href="https://www.rfc-editor.org/rfc/rfc9458.html">Oblivious HTTP</a> to provide these functions in much smarter and targeted ways.
However, doing so would require coordination between implementers (see: Community) and a deep respect for the user in how they’re designed (see: User Agency).
<h3 id="what-else">What Else?</h3>
I’m sure there’s more. I wrote this primarily to gauge interest: who’s up for taking Web feeds to the next level?
If this is interesting to you, I’d love to hear about it. I asked for an <a href="https://mailman3.ietf.org/mailman3/lists/feeds.ietf.org/">IETF mailing list about feeds</a> to be set up a while back, and while it hasn’t been used yet, that’s probably the best place to start – please subscribe and post there!]]>
</content>
</entry>
</feed>

If you would like to create a banner that links to this page (i.e. this validation result), do the following:

Download the "valid Atom 1.0" banner.
Upload the image to your own server. (This step is important. Please do not link directly to the image on this server.)
Add this HTML to your page (change the image src attribute if necessary):

<a href="http://www.feedvalidator.org/check.cgi?url=http%3A//www.mnot.net/blog/index.atom"><img src="valid-atom.png" alt="[Valid Atom 1.0]" title="Validate my Atom 1.0 feed" /></a>

If you would like to create a text link instead, here is the URL you can use:

http://www.feedvalidator.org/check.cgi?url=http%3A//www.mnot.net/blog/index.atom

Home · About · News · Docs · Terms