English (GB)
5879 ranked first names in the benchmark
- Top 100 (100)
- 16%
- Top 500 (500)
- 18.2%
- Top 1000 (1000)
- 19.7%
- Avg. variants among matches
- 1.29
Informational benchmark
Yes, many real names can be spelled with chemical elements, but most still cannot. In our country-level top-1000 first-name benchmarks, hit rates run from 15.7% in Dutch (NL) to 20.1% in German (DE), so the useful question is not just “can it work?” but “which kinds of popular names work most often, and why?”
To answer that, I benchmarked top-ranked first-name lists for Carbonat's GB, DE, ES, FR, IT, and NL datasets, plus a small Welsh fallback list, and asked three practical questions: how often top-100, top-500, and top-1000 names can be spelled with real element symbols; which countries keep the strongest hit rate as you go deeper into the ranking; and which popular names stay flexible enough to allow more than one valid split.
None of this terminology is technically grounded. I made these labels up for the sake of naming the patterns in a readable way.
A split is one valid way to break a name into element symbols, such as Simon becoming `Si` + `Mo` + `N`. A split pair is one adjacent joining inside that split, such as `Si` + `Mo` in the same example. Split patterns means the recurring symbol shapes I use to describe many names in the same dataset or language family, such as whether names tend to lean on one-letter symbols like `N` and `S`, vowel-heavy endings like `I` and `O`, or repeated adjacent pairs like `I` + `N` and `C` + `H`. Hit rate is simply the share of names in a benchmark that parse successfully, while variants counts how many different valid element spellings a single name has.
Quick answer
Last updated March 31, 2026
Popular names do work with element symbols often enough to be interesting, but not often enough to assume your favorite name will parse cleanly. Romance benchmarks tend to be more flexible, German (DE) leads the country-level top-1000 hit rate, and in the country-level data every recorded failure ends up in one of two parser problems: the opening never matches a valid symbol, or a promising split dies later.
These numbers come from Carbonat's ranked first-name benchmark lists, not from a universal census of every real-world name in a language. For GB, DE, ES, FR, IT, and NL, the benchmark preserves country rank order from the same upstream source already used by the names feature. Welsh is a repository-local fallback list in ranked file order, so it is informative but not directly comparable to the larger country-derived benchmarks.
That is why the safest wording here is “in our GB benchmark list” or “in our Italian benchmark” rather than “in the world” or “in every Italian name.” The value of the analysis is that every benchmark is measured with the exact same element parser and transliteration rules, so the differences come from the names themselves rather than from changing methods.
Here, a benchmark means one ranked list of popular first names that I test with the same parser from top to bottom. When I call Welsh a fallback, I mean it is a smaller repository-local list rather than a country-scale ranked source like the GB, DE, ES, FR, IT, and NL datasets.
Broader word-level spelling statistics live on the language info page. This page is strictly about names.
German (DE) keeps the highest elementizable hit rate when you look across the full top 1000 names in our benchmark: 20.1%. That makes German the strongest country-level performer in this first pass.
Italian (IT) does not lead on raw hit rate, but its matched popular names are the most flexible, averaging 1.62 valid spellings among top-1000 matches. That is where names like Francesco become especially interesting.
German (DE) improves the most as you move from the top 100 names (16%) to the top 1000 (20.1%). In other words, German's popular-name compatibility gets stronger once you move past the absolute biggest household names.
Welsh shows a 100% hit rate in this artifact only because the fallback list is tiny: 11 ranked names, already curated for elementizability. On the other end, the Dutch benchmark drops from 18% in the top 100 to 15.7% in the top 1000, which makes it the strongest example of a long tail that becomes less parser-friendly.
| Benchmark | Top 100 | Top 500 | Top 1000 | Avg. variants |
|---|---|---|---|---|
| German (DE) | 16% | 19.4% | 20.1% | 1.47 |
| English (GB) | 16% | 18.2% | 19.7% | 1.29 |
| Spanish (ES) | 19% | 19% | 19.3% | 1.47 |
| French (FR) | 17% | 17.8% | 19.2% | 1.35 |
| Italian (IT) | 17% | 17.8% | 18.7% | 1.62 |
| Dutch (NL) | 18% | 16% | 15.7% | 1.37 |
5879 ranked first names in the benchmark
11 ranked first names in the benchmark
5799 ranked first names in the benchmark
5505 ranked first names in the benchmark
5494 ranked first names in the benchmark
5885 ranked first names in the benchmark
5803 ranked first names in the benchmark
The useful pattern is not that English fails one way and Italian another. Across all six country-level top-1000 benchmarks combined, every recorded failure in this dataset falls into two buckets. 51.9% of failures never get off the ground because the opening letters do not match any element symbol, while the remaining 48.1% start plausibly and then strand leftover letters that the periodic table cannot finish.
That makes this section more about parser geometry than about national quirks. The same two breakpoints keep reappearing whether the example is David, Jose, Marie, or Paul.
2529 of 4873 top-1000 failures across GB, DE, ES, FR, IT, and NL
These names fail immediately. Their first one to three letters never produce a valid element symbol, so the parser cannot even begin a split.
Fails immediately after normalization to david: none of d, da, dav is a valid opening element symbol.
Fails immediately after normalization to giuseppe: none of g, gi, giu is a valid opening element symbol.
Fails immediately after normalization to jan: none of j, ja, jan is a valid opening element symbol.
Fails immediately after normalization to jose: none of j, jo, jos is a valid opening element symbol.
2344 of 4873 top-1000 failures across GB, DE, ES, FR, IT, and NL
These names do begin with valid symbols, but every possible path eventually leaves a remainder that no element symbol can cover.
Gets partway through as P + Au, then stalls on l.
Gets partway through as P, then stalls on eter.
Gets partway through as Th + O, then stalls on mas.
Gets partway through as H, then stalls on ans.
The benchmark data lines up with the broader shape analysis I already measured in Carbonat's name datasets. Italian and Spanish still look vowel-heavy overall, with 83.9% and 66.3% of names ending in vowels in our larger dataset pass. That helps explain why those countries keep surfacing names like Francesco, Francisco, and Cristina as strong matches.
German leads the top-1000 hit rate not because every top name works, but because its compatibility stays resilient deeper into the benchmark. Dutch moves in the opposite direction, which suggests a long tail that becomes less element-friendly even though some very high-ranked names, such as Henk and Frank, still work early.
Accented spellings do not disappear from the analysis. Carbonat normalizes diacritics and a few common transliterations before testing whether a name can be written with elements, so names like Acuña and Iñaki remain searchable even though the parser ultimately works on normalized letter sequences.
One small pattern also shows up if you zoom in on the boundary letters inside split pairs. Across the current dataset, the most common joins are i + n, o + n, a + n, and c + h. In practice that means many successful boundaries end with a vowel-like symbol and then reopen on N or I, while clusters like ch often survive as adjacent symbols too. So the joins are not random: they reflect both the periodic-table inventory and the spelling habits of the names that make it through the parser.
Across our 7,568 Germanic names (`en`, `de`, `nl`), only 10.3% start with a vowel. The most common initials are B, S, and K.
The strongest symbols overall lean toward N, S, and K. The most common split pairs are I + N and C + H, which shows how often those consonant-heavy shapes stay adjacent.
That profile still produces useful splits, but it is less flexible overall at 1.44 variants per name on average.
In our 8,364 Romance names (`es`, `fr`, `it`), 13.1% start with a vowel and 66% end with one. The opening symbol is most often C.
The overall split inventory leans heavily on I and O. The most common split pairs are N + I and I + N, which fits the way vowel-friendly segments keep chaining together.
That helps explain why the family averages 1.61 variants per name and 37.6% alternative splits.
Welsh is harder to generalize because the current sample is tiny and curated, not a full country-ranked benchmark. In this fallback set of 21 names, starts cluster around P and B.
The opening symbols Be and Pr are unusually prominent, while pairs like Y + S show how different the tiny sample can look.
The safe read is still “interesting shape, limited evidence,” not a broad claim about all Welsh names.
en, de, nl · 7,568 names
es, fr, it · 8,364 names
cy · 21 names
I avoid saying “most common names in the world,” but there is still a useful cross-country angle: some names keep reappearing across several ranked country lists and still parse cleanly into chemical symbols. These are the names that travel best across our current benchmark set.
Appears in 6 benchmark lists; best rank #9
DE, ES, FR, GB, IT, NL
Appears in 6 benchmark lists; best rank #13
DE, ES, FR, GB, IT, NL
Appears in 6 benchmark lists; best rank #58
DE, ES, FR, GB, IT, NL
Appears in 6 benchmark lists; best rank #67
DE, ES, FR, GB, IT, NL
Appears in 6 benchmark lists; best rank #80
DE, ES, FR, GB, IT, NL
Appears in 6 benchmark lists; best rank #121
DE, ES, FR, GB, IT, NL
Yes, but they are rare. In the current elementizable-name dataset I only found 57 unique normalized palindromic names at all. They are constrained twice over: first, the letters have to mirror cleanly from left to right and right to left; second, that mirrored sequence still has to break into real element symbols rather than just looking like a word palindrome.
First names are slightly more common than surnames in this corner of the data, but they also skew shorter and more toy-like. I found 38 unique palindromic first names, with an average normalized length of 3.6, including Reinier, Nalan, and Natan. Surnames are a little rarer but a bit longer and more structural: 31 unique palindromic surnames, averaging 4, including Renner, Serres, and Staats.
The overlap is small too: only 12 normalized forms show up in both buckets, mostly very short shapes such as alla, bab, bob, esse, kik. That fits the general pattern: once names get longer, it becomes much harder to satisfy both palindrome symmetry and element-symbol segmentation at the same time.
38 unique normalized matches; avg. length 3.6
31 unique normalized matches; avg. length 4
Can you find any more? Try the names browser with your own hunches, or jump into the designer if you discover a palindrome that looks good as tiles.
If your goal is to test a real first name or surname, the fastest next step is the names browser, where you can search examples that already parse. If you already know which split you like, continue into the designer to customize and download it.
If you are more interested in the symbol inventory itself, the interactive periodic table shows the building blocks, while the word-level language statistics page compares broader word-level behavior beyond names.
If you discover a name, poem, or sentence that feels especially good as element tiles, you can now send it to the community submissions page so it can be reviewed and, once approved, added to the shared public corpus for other people to like and share.
And yes, in a mildly shameless but hopefully still charming way, a name that splits nicely can make a genuinely fun gift. If you end up with a layout you love, putting it on a t-shirt or another item is a fairly reliable route to a unique geek gift for someone who enjoys chemistry, wordplay, or both.