LLMs are bad at vibing specifications

Lobsters

BUbuttondown.com by hwayne

Mar 11, 2026 at 02:53 AM6.0/10

The article examines how LLMs perform poorly at writing formal specifications in languages like TLA+ and Alloy, despite initial optimism about AI as a 'specification force multiplier'. Using a real-world example, the author shows that AI-generated specs often contain fundamental errors like incorrect Boolean usage and missing imports. This highlights the gap between AI assistance and genuine expertise in formal methods.

Background

Formal methods like TLA+ and Alloy are mathematical approaches to software specification and verification. While powerful, they require significant expertise, leading to interest in using AI to lower the barrier to entry.

Source: Lobsters
Published: Mar 11, 2026 at 02:53 AM
Score: 6.0 / 10

Read Original →