Why Your JSON Has a BOM (Byte Order Mark) and How to Strip It

It was 11pm on a Friday and our nightly ETL was failing on a customer file that had worked yesterday. Same vendor, same format, same column headers. Parser kept choking at "position 0." Five engineers staring at it. Nothing in the file looked wrong. Three of us re-downloaded the file thinking the upload corrupted something. It hadn't.

Around midnight someone ran head -c 3 file.json | od -An -c and got back 357 273 277. That's a UTF-8 BOM. Some PowerShell script in the customer's pipeline had quietly added it, and three lines of JSON.parse couldn't handle it. We've kept that command in our debugging cheat sheet ever since. This post is what we wish someone had told us before that night.

What's a BOM, actually?

A Byte Order Mark (BOM) is a magic prefix that tells a text reader two things: which UTF encoding the file uses (UTF-8, UTF-16-LE, UTF-16-BE, UTF-32), and which byte order. For UTF-16, it actually matters. For UTF-8, byte order is meaningless: bytes are bytes, and the BOM is purely a "this is UTF-8" flag.

The Unicode standard says UTF-8 BOMs are allowed but not required. Some tools add one anyway:

Tool	Adds UTF-8 BOM?
Notepad (classic, pre-Windows 11)	Yes, by default
Excel "Save as CSV (UTF-8)"	Yes
PowerShell `Out-File -Encoding utf8` (Windows PS 5.1)	Yes
PowerShell `Out-File -Encoding utf8` (PS 7+)	No (changed in PS 6)
VS Code	Configurable, default no
Linux `cat`, `vim`, `nano`	No
Most programming languages writing UTF-8	No

JSON's spec is unambiguous: JSON parsers MUST NOT add a BOM, and they MAY treat a leading BOM as an error. Most parsers accept it, but JSON.parse in JavaScript does not:

JSON.parse('{"name":"hello"}')
// SyntaxError: Unexpected token  in JSON at position 0

The position is 0 because the BOM is character 0. Useless error. This is the bug.

Detect a BOM

Three reliable ways:

// In JavaScript, after reading the file
const text = await fs.readFile('data.json', 'utf-8')
const hasBom = text.charCodeAt(0) === 0xFEFF

# In a shell — check for the EF BB BF sequence
head -c 3 data.json | od -An -c
# If you see   357 273 277, that's a BOM.

# Even simpler with file(1)
file data.json
# data.json: UTF-8 Unicode (with BOM) text

Strip a BOM

The cleanest fix is at the read site, since you can't always control the writer.

JavaScript / Node

function stripBom(text) {
  return text.charCodeAt(0) === 0xFEFF ? text.slice(1) : text
}

const json = JSON.parse(stripBom(text))

There's also a tiny package called strip-bom that does exactly this. It's 4 lines of code; pick a side on whether that's a dependency or a copy-paste.

Python

import json

with open('data.json', encoding='utf-8-sig') as f:
    data = json.load(f)

The utf-8-sig codec is the trick, it reads UTF-8 but transparently strips a leading BOM. There's no equivalent "strip BOM" flag in json itself; it's the codec that handles it.

PowerShell (modern)

$content = Get-Content data.json -Encoding utf8NoBOM -Raw
$json = $content | ConvertFrom-Json

Note utf8NoBOM is PS 6+ only. On Windows PowerShell 5.1, you need [System.IO.File]::ReadAllText('data.json') and then strip manually.

`jq` and `sed`

Sometimes you just want the BOM gone in a file:

sed -i '1s/^\xef\xbb\xbf//' data.json

Or, to pipe through jq after stripping:

sed 's/^\xef\xbb\xbf//' data.json | jq .

Online, when you don't want to install anything

Paste the content into a JSON Formatter, pasting strips the BOM by virtue of the browser only copying the visible text. That's a slightly accidental fix, but it works for one-off cases.

Prevent BOMs at the source

Stripping at read is defensive; the better fix is to stop the BOM from being written in the first place.

Excel: when saving as CSV, choose "CSV UTF-8 (Comma delimited)", but be aware Excel still writes a BOM. To avoid, save as plain CSV and convert externally, or use csv-to-json tools that handle BOM input gracefully.
PowerShell: prefer PS 7+, or use [System.IO.File]::WriteAllText($path, $text, [System.Text.UTF8Encoding]::new($false)), that $false is the BOM flag.
VS Code: bottom-right encoding selector → "Save with Encoding" → UTF-8 (without BOM). The default is no-BOM unless you've configured otherwise.
Build pipelines: validate. Most CI lint setups can include a BOM check in pre-commit.

A simple shell check that fails the build if any tracked file has a BOM:

git ls-files '*.json' '*.ts' '*.tsx' | while read f; do
  if [ "$(head -c 3 "$f" | od -An -c | tr -d ' ')" = "357273277" ]; then
    echo "BOM found in $f"
    exit 1
  fi
done

Drop it in .husky/pre-commit or a CI step. Catches BOMs the day they enter the repo, not in production three weeks later.

Why does this still happen in 2026?

Because Excel still adds a BOM by default when exporting "CSV UTF-8", the most common path between non-engineers and JSON pipelines. Because Windows PowerShell 5.1 ships on every Windows machine and emits BOM by default. Because the BOM is invisible in every editor and shows up in cmd.exe as ?.

The Unicode committee acknowledged in 2024 that recommending against BOMs in UTF-8 was overdue, but the existing tools haven't all caught up. Until they do, write defensive parsers.

Recommended workflow

Reading external JSON in code: always strip a leading BOM before parsing. Cheap, defensive, no downsides.
Reading external JSON ad hoc: paste into a JSON Formatter, it'll show you parse errors with context.
Writing JSON: never add a BOM. Set your tools' defaults once.
Pre-commit: validate no tracked text file starts with EF BB BF.

The unsexy truth: BOMs are a cross-platform pollution problem, and the only defense is layered. Strip on read, never write, and check in CI. None of these is sufficient alone.

Related tools on DevTools Online:

JSON Formatter, paste and format, BOM stripped automatically
JSON Validator, finds malformed JSON beyond just the BOM case
CSV to JSON, converts Excel exports correctly
String Inspector, see invisible characters byte by byte

120+ Online Developer Tools — No Sign-up, Runs in Browser

DevTools Online is a collection of 120+ online developer tools built for software engineers, web developers, and digital creators. Every tool runs entirely in your browser — no sign-up required, no data leaves your device, and no tracking. Whether you need to format JSON, beautify SQL queries online, decode a JWT token, or generate a QR code, DevTools Online has you covered in seconds.

JSON Formatter, Validator & Converter Online

Format and beautify JSON online, validate JSON structure, minify JSON for production, and diff two JSON objects side by side. Convert JSON to C# classes, TypeScript interfaces, CSV, YAML, or XML in one click — no login required.

SQL Formatter Online — Beautify, Convert & Visualize SQL

SQL formatter online for SELECT, INSERT, UPDATE, DELETE, and DDL statements. Convert SQL to C# LINQ, visualize SQL execution plans, and generate ERD diagrams from CREATE TABLE scripts — all in your browser.

Base64 Encoder Decoder, JWT Decoder & Hash Generator

Encode and decode Base64 strings, URLs, and HTML entities online. Generate MD5, SHA-256, SHA-512, and BCrypt hashes instantly. Decrypt and encode JWT tokens, AES-256 encryption — all client-side, no data sent to any server.

DNS Lookup, SSL Certificate Checker & IP Geolocation Online

Look up DNS records, check SSL certificates online, inspect HTTP headers, parse user-agent strings, geolocate IP addresses, and calculate CIDR subnets — all instant, no installation needed.

XML, YAML, HTML & CSS Formatter — Online Code Beautifier

Format and beautify XML, YAML, HTML, CSS, and Markdown online. Minify JavaScript and CSS for production. Preview Mermaid diagrams and build ERD schemas directly in your browser — online developer tools, no account needed.

UUID Generator, QR Code Generator & Password Generator Online

Generate UUIDs, NanoIDs, secure random passwords, QR codes, barcodes, favicon sets, fake test data, and TOTP 2FA codes — all available online without any account or installation.

Privacy-First Developer Tools — 100% Client-Side

All DevTools Online tools run entirely client-side in your browser. The data you paste into a tool — passwords, tokens, keys, JSON payloads — is processed locally and never sent to our servers. The website itself uses analytics and advertising cookies to stay free, with consent managed via our cookie banner.