<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Visual Reasoning on Tom Burkert</title><link>https://blog.burkert.me/tags/visual-reasoning/</link><description>Recent content in Visual Reasoning on Tom Burkert</description><image><title>Tom Burkert</title><url>https://blog.burkert.me/assets/</url><link>https://blog.burkert.me/assets/</link></image><generator>Hugo -- 0.148.0</generator><language>en-us</language><lastBuildDate>Mon, 08 Dec 2025 21:35:09 +0100</lastBuildDate><atom:link href="https://blog.burkert.me/tags/visual-reasoning/index.xml" rel="self" type="application/rss+xml"/><item><title>How many AI PhDs does it take to change a hinge?</title><link>https://blog.burkert.me/posts/ai_phds_changing_a_hinge/</link><pubDate>Mon, 08 Dec 2025 21:35:09 +0100</pubDate><guid>https://blog.burkert.me/posts/ai_phds_changing_a_hinge/</guid><description>&lt;p>It started with a broken hinge in our kitchen cabinet. I am no handyman, but I like to do as much as I can around the house and I dare say I&amp;rsquo;ve gotten pretty okay at it. The one thing that can set me back, though, is not knowing the correct terminology. What type of hinge do I need? Concealed? Inset? Overlay? Clip-On? Soft-close? Sprung? An avalanche of terms that I only barely understand paralyzes me. Luckily, there&amp;rsquo;s a new helper in town: AI!&lt;/p></description><content:encoded><![CDATA[<p>It started with a broken hinge in our kitchen cabinet. I am no handyman, but I like to do as much as I can around the house and I dare say I&rsquo;ve gotten pretty okay at it. The one thing that can set me back, though, is not knowing the correct terminology. What type of hinge do I need? Concealed? Inset? Overlay? Clip-On? Soft-close? Sprung? An avalanche of terms that I only barely understand paralyzes me. Luckily, there&rsquo;s a new helper in town: AI!</p>
<p>I&rsquo;ve used vision-enabled LLMs before to identify types of tools, bike parts, or PC components with mixed results, but the net result was usually positive. At the least, I became armed with some of the terminology and knew what to look up on Google. The recent releases of Gemini 3 Pro (&quot;<a href="https://blog.google/technology/developers/gemini-3-pro-vision/" target="_blank" rel="noopener">The frontier of vision AI</a>&quot;), GPT-5 (&quot;<a href="https://openai.com/index/introducing-gpt-5/" target="_blank" rel="noopener">can reason more accurately over images and other non-text inputs</a>&quot;) and Claude Opus 4.5 (&quot;<a href="https://www.anthropic.com/news/claude-opus-4-5" target="_blank" rel="noopener">better vision, reasoning, [&hellip;] and it is state-of-the-art in many domains</a>&quot;) have made me curious to test the much hyped visual reasoning capabilities. Plus, they are touted to have PhD- or expert-level intelligence. Can an AI PhD help me change a hinge?</p>
<h2 id="mission-impossible">Mission (im)possible</h2>
<p>I set out with a simple request: &ldquo;Identify this type of hinge and find out where it can be bought in Czechia,&rdquo; accompanied by the following picture:<br>
<img alt="Hinge" loading="lazy" src="/images/hinge.jpg#center"></p>
<p>Now, a small aside about hinges — the little I know. I promise it won&rsquo;t be long, it can be explained in 30 seconds.</p>
<p>This is a hinge with a cup:<br>
<img alt="Hinge with a cup" loading="lazy" src="/images/hinge-cup.jpg#center"></p>
<p>And this, my friends, is a type of hinge that does not use a cup, but is mounted directly to the face frame of the cabinet:<br>
<img alt="Hinge without a cup" loading="lazy" src="/images/hinge-face-frame.jpg#center"></p>
<p>You can tell them apart very easily - I actually think 99% of you could now tell them apart without any issues. You can probably also tell that the hinge from my original picture is very clearly the latter one, as there is no place to mount the cup to in the cabinet face frame.</p>
<p>Our PhD-level expert LLM friends? Not so much.</p>
<h2 id="getting-unhinged">Getting unhinged</h2>
<p>Let&rsquo;s start with Gemini. Even though I prompted it in English, it started thinking and responding in Czech, so I can&rsquo;t quote it verbatim, but here&rsquo;s a faithful translation from Google Translate:</p>
<blockquote>
<p>The specific type in the photo appears to be: A cup hinge with a cup diameter of 35mm (this is the standard size, but it&rsquo;s a good idea to check).</p></blockquote>
<p>OK, that&rsquo;s clearly wrong.</p>
<p>Let&rsquo;s move on to Claude Opus 4.5, with excellent visual reasoning:</p>
<blockquote>
<p>This is a Hettich concealed cabinet hinge (also called a European-style or cup hinge).</p></blockquote>
<p>Not really! But wait, Claude is searching the web, maybe it can recover:</p>
<blockquote>
<p>Based on my search, this is a Hettich Intermat concealed cabinet hinge (also known as a &ldquo;miskový závěs&rdquo; or cup hinge in Czech). It features a 35mm cup diameter and includes a clip-on mounting plate for easy installation.</p></blockquote>
<p>Not quite either.</p>
<p>Maybe ChatGPT with GPT-5 will save the day?</p>
<blockquote>
<p>Based on your photo, this appears to be a “European-style concealed cabinet hinge” — but specifically a variant that is face-frame / surface-mounted rather than the hidden “cup-in-door” type often used in frameless cabinets.</p></blockquote>
<p>Alright, that&rsquo;s much better! It seems that GPT-5 can reason more accurately over images. So what is its recommendation?</p>
<blockquote>
<p>My Recommendation<br>
Given your photo and situation, I’d start by ordering a clip-on concealed hinge for face-frame cabinets with a 35 mm cup — either basic or soft-close depending on whether you care about silent closing. That has the highest chance of matching the old hinge and mounting method.</p></blockquote>
<p>Somehow, in the stream of tips and examples, GPT-5 convinced itself that this was after all a hinge with a cup, and ended up giving me the wrong recommendation.</p>
<p>I honestly do not know why all the models faceplanted so badly. It does seem that my type of hinges is relatively rare, but at least GPT-5 identified it correctly at first.</p>
<h2 id="just-ask-a-real-expert">Just ask a real expert</h2>
<p>Turns out, the AI labs aren&rsquo;t optimizing for mundane DIY realities, and so you&rsquo;d be better off asking literally anyone at your local hardware store or DIY shop. Even the SOTA LLMs are not there in terms of giving you reliable information for this category of problems. They fell short of even identifying my hinge, much less finding a site where I could buy a replacement.</p>
<h2 id="the-smallest-of-wins">The smallest of wins</h2>
<p>There is a silver lining. When I confronted the LLMs with the fact that the hinge in the picture clearly does not have a cup, all three recovered and provided some genuinely useful information. They helped me nail down the terminology, and pointed me toward several e-shops I could browse to look for what I needed.</p>
<p>However, none of them were able to identify a specific product I could actually order. And perhaps more importantly, they only became helpful after I called them out on their mistakes. If I had trusted the initial responses (or if this had been a topic I knew even less about), I could easily have ended up ordering the wrong part.</p>
<p>An assistant you have to constantly fact-check and correct is of questionable value. For now, these expensive statistical behemoths remain more useful as a second opinion than a first resort for this type of problems.</p>
]]></content:encoded></item></channel></rss>