Building a Multi-Protocol NFT Gallery for a Bitcoin Wallet

15 November 2025

code

Bitcoin and Stacks have multiple, completely different standards for digital collectibles. Ordinal Inscriptions live on Bitcoin’s base layer. SIP-9 NFTs are smart contract tokens on Stacks. Bitcoin Stamps use a different encoding method entirely. BNS Names are Stacks’ naming system. Each has different APIs, different metadata formats, and different rendering requirements.

We needed to build a gallery that treats them all as first-class citizens — displaying images, audio, video, SVGs, and even 3D models — while keeping the codebase maintainable. I picked up this work as part of the mobile app build-out, and it became one of the biggest feature areas I worked on.

I’d been around crypto long enough to have a general understanding of NFTs — I’d seen the ERC-721 standard from the Ethereum side and understood the basic mechanics of minting and transferring tokens. But Ordinal Inscriptions were new to me. The concept of inscribing data directly onto individual satoshis, using Bitcoin’s witness data, felt genuinely different from the smart-contract-based NFTs I was familiar with. There’s no contract to interact with, no token ID in the traditional sense — an inscription is identified by its position in the blockchain, tied to a specific sat.

My mental model going in was that we’d have a single “NFT” abstraction and each protocol would slot into it. That turned out to be naive. The protocols differ not just in their APIs but in their fundamental data models. An Ordinal inscription is content inscribed on a satoshi. A SIP-9 NFT is a smart contract token with an owner and metadata URI. A Bitcoin Stamp encodes image data in transaction outputs using a completely different method. Trying to force these into one unified model early would have papered over differences that actually matter to users — like sat rarity for Ordinals, or collection traits for SIP-9 tokens.

The Approach

Protocol-Specific Data Layers

Rather than trying to normalise everything into one shape early, we kept each protocol’s data layer separate. Each has its own service, its own query hooks, and its own type definitions. This meant adding Stamps support (#1689) didn’t touch the Ordinals code, and integrating the Gamma API for richer SIP-9 metadata (#1635) didn’t affect anything else.

The key insight was that normalisation should happen at the UI layer, not the data layer. Each protocol maps to a shared CollectibleView type that the gallery components consume.

I arrived at this through the usual route: trying the wrong thing first. Early on, I spent time sketching out a unified Collectible type that would hold all the fields from every protocol. It quickly became unwieldy — optional fields everywhere, type guards scattered through the rendering code, and a growing sense that the abstraction was fighting the domain rather than serving it. An Ordinal inscription doesn’t have “traits” in the way a SIP-9 NFT does. A BNS name doesn’t have “content” the way an inscription does. The union type was technically correct but practically useless.

The turn came when I started building the detail pages. Each protocol needed to display fundamentally different information, and the unified type meant every detail page was full of conditional rendering and null checks. It was simpler and clearer to let each protocol keep its own shape and only converge at the gallery level, where all you really need is an image, a title and a type indicator. The CollectibleView type captures exactly that — the minimum shared surface for rendering a grid of collectibles — while each protocol retains its full richness for detail views.

Multi-Source Data Merging

For SIP-9 NFTs, we integrated two separate APIs — Hiro and Gamma — and merged their data to get the best coverage. The team already had the Hiro integration; I added the Gamma layer and the merge strategy. Gamma provides richer metadata (collection info, rarity), while Hiro has broader token coverage. The merge strategy prioritises Gamma data when available, falling back to Hiro.

Rich Media Handling

Ordinal inscriptions can be anything — images, audio, video, SVG, HTML, even 3D GLTF models. Each format needs different rendering:

Images: Standard <Image> with IPFS gateway resolution and fallbacks
Audio: Audio player with waveform display
Video: Video player with auto-generated thumbnail previews (#1756) — the app plays the video briefly and captures a frame
SVG: Rendered inline with sandboxing
3D Models: GLTF viewer component

Video Thumbnails: A Fun Problem

The video thumbnail generation (#1756) was a satisfying solve. Rather than requiring a separate thumbnail service, the approach uses client-side frame capture that plays the video momentarily and screenshots it. The entire solution was a single file change (+213/-111 lines). The PR has a video demo showing it in action.

The obvious alternative was server-side thumbnail generation — run a service that fetches inscription videos, extracts a frame with FFmpeg, and caches the resulting image. That would have been more reliable in some ways, but it meant standing up and maintaining infrastructure for what was ultimately a presentation concern. The wallet is a client-side application; adding a server dependency for thumbnails felt like the wrong trade-off, especially when the videos were already being loaded on the client anyway.

The client-side approach won because it required zero infrastructure. The component loads the video in a hidden player, seeks to a short offset (a fraction of a second in), captures the current frame to a canvas, and converts it to a data URL. It’s not perfect — some video formats don’t seek cleanly, and there’s a brief flicker if the video loads slowly — but for the gallery view it’s more than good enough. The main edge case was videos with a black or empty first frame, which we handled by seeking slightly further in rather than capturing frame zero. MP4 and WebM worked reliably; more exotic formats inscribed on-chain were rare enough that we handled them with a generic fallback thumbnail.

Detail Pages Per Protocol

We split collectible detail views by protocol (#1642, #1636) because each protocol has different metadata worth displaying. An Ordinal inscription shows its inscription number, content type, and sat rarity. A SIP-9 NFT shows its contract, collection, and traits. Trying to force these into one detail page would have been a mess.

From Mobile to Extension

After building this for mobile (Sep-Nov 2025), I extracted the shared logic into packages/features (#1981) and worked on bringing it to the browser extension (#1837, #2067). Other team members contributed to the extension integration too — the queries and API layers that powered all of this were already in place thanks to earlier work by the team. The CollectibleView abstraction meant the same data logic powered both platforms, with only the rendering differing.

The “go live” moment was #2067 — shipping the new extension home tab with a Collectibles tab alongside Assets and Activity.

The Result

The gallery handles four distinct blockchain protocols, multiple media formats, and two data sources, while the UI code stays clean because each protocol handles its own complexity behind a shared interface.

Key PRs

PR	What	Stats	Video?
#1689	Stamps service integration	+2,110/-1,133, 18 files
#1635	Gamma API for SIP-9	+409/-315, 15 files
#1636	Collectibles UI overhaul	+537/-464, 30 files
#1642	Audio, video, SVG formats	+817/-95, 27 files
#1756	Video thumbnail generation	+213/-111, 1 file	YES
#1981	Shared CollectibleView utilities	+973/-111, 33 files
#1837	Cross-platform activity + NFTs tab	+4,796/-1,678, 152 files	YES
#2067	Go live: extension home tab	+284/-99, 19 files