Origins of the catalogue · the incipit key
The oldest library catalogues identify a work by its first line, because a title-less literature has no other handle. I have been claiming, on the strength of one quoted gloss, that this key is ambiguous — that a first line collides. Here is the rate, read off the eleven Old Babylonian literary catalogues that survive.
A title-less corpus keys on the incipit because it has no choice: Sumerian literature was referred to by a work's first line, and "literary catalogue" was one of its native genres from the start.1 I have argued — across two earlier entries — that the whole later evolution of the catalogue, the slow bolting-on of author, title, length, and authenticity, is the repair of an ambiguous key: a first line is not unique, so identity had to acquire more fields. My evidence was an anecdote. The Old Babylonian Nippur catalogue glosses a handful of entries "four compositions are known with this incipit." That illustrates the problem; it does not measure it.
A too-clean thesis reads to me as a missing footnote. So: how often does the first line actually fail as a key? The instrument is direct. ETCSL — the Electronic Text Corpus of Sumerian Literature — hosts the surviving OB literary catalogues as section c.0.2.x, eleven of them, each a numbered list of incipits with the editors' identification of what every line resolves to. Pull all eleven, sort every entry, and the anecdote becomes a rate.
The parser (parse_catalogue.py, pure standard library) sorts every catalogue entry into one of four states:
That fourth state is the honesty pivot, and it cost me a wrong headline. My first pass had no broken category, so it folded smashed signs in with lost works and reported that 53% of first lines pointed to nothing. But a broken tablet is a preservation accident — it tells you about the 3,800 years since, not about the key. Separating the two drops the loss figure by a third and lets each number mean one thing. Resolution always wins over damage: a half-broken line the editors could still identify counts as one, never as broken.
Pooled over all eleven catalogues — 442 entries, of which 101 (22.9%) are physically broken, leaving 341 legible:
| state of the first line | count | of legible |
|---|---|---|
| resolved to one surviving work — the key worked | 185 | 54.3% |
| collision — shared by 2–5 works (editor-flagged) | 22 | 6.5% |
| ghost — legible line, no surviving work | 134 | 39.3% |
| key non-unique or dead (collision + ghost) | 156 | 45.7% |
| broken (unreadable first line) — shown separately | 101 | 22.9% of all 442 |
The single oldest and most-cited catalogue — N2, ETCSL c.0.2.01, the tablet UM 29-15-155 that Kramer published in 1942 as "the oldest literary catalogue"2 — is the cleanest, because it is intact: 62 entries, none broken, 48 (77.4%) resolved to one work, 8 (12.9%) flagged collisions, 6 (9.7%) ghosts. The more damaged a tablet, the larger its legible-but-unresolved tail (U3, N3, N4, N6 run 50–76% ghost) — and that is itself a result: what the catalogue increasingly preserves is the bare existence of works whose texts are gone. The schema outlives the collection. It is the same shape I found in the Han imperial bibliography: the catalogue is what you write down because you fear loss.3
Three first lines carry the argument:
ud re-a — "In those distant days…", the stock opening of Sumerian epic — names four works: Enki and Ninmaḫ, Enki's journey to Nibru, Gilgameš, Enkidu and the nether world, and The instructions of Šuruppag. The N2 catalogue enters this identical line three separate times (lines 7, 20, 21). It is not a copying error: the scribe is cataloguing three different poems and has no way, in the key he uses, to write down which is which. The ambiguity of the first-line key is documented, in triplicate, at the origin of cataloguing.
nam₂ nun-e — shared by five works. Three survive (Nanna M, A hymn to Nanna, The Keš temple hymn); a fourth and a fifth are attested only as entries in other catalogues, their texts entirely lost. One first line, five compositions, two of them ghosts even at the corpus level.
dumu e₂-dub-ba — "son of the tablet-house", the formulaic opening of the scribal-school literature — names at least five works across the pooled catalogues. A genre's shared opening formula is, by construction, a broken key: everything in the genre begins the same way.
And the editors' flags are only a floor. Pooling all eleven catalogues and grouping blindly by first line — no annotation — surfaces collisions invisible within any single tablet: u₃ nu-mu-e-de₃-ri-ri names two works in the N3 catalogue; lugal ḫi-li gur₃-ru names two royal hymns in N6. The true ambiguity rate is higher than the 6.5% the editors marked.
It would be too tidy to end on "the key is broken." It isn't. in-nin me ḫuš-a appears in five different catalogues and resolves, every time, to Inana and Ebiḫ; e₂ ud ḫuš an ki in four, always to Nungal A. In the clear majority of legible cases the first line is a perfectly serviceable key. The honest shape is not a failure but a failure tail: a key that works 54% of the time cleanly, fails outright a few percent, and fades into lost works at the edge.
Node 1 of this field guide is no longer "the Old Babylonian period, conventionally ~1800 BCE." It is eleven specific tablets, each with an editio princeps:
| siglum | ETCSL | tablet | first edition |
|---|---|---|---|
| N2 | c.0.2.01 | UM 29-15-155 · Philadelphia | Kramer 1942 |
| L | c.0.2.02 | AO 5393 · Louvre | Kramer 1942 |
| U1 | c.0.2.03 | U 16876 B · Ur | Charpin 1986 |
| U2 | c.0.2.04 | U 17900 H · Ur | Charpin 1986; Kramer 1961 |
| U3 | c.0.2.05 | UET 6 196 · Ur | Michalowski 1984 |
| N3 | c.0.2.06 | HS 1504 · Jena | Bernhardt & Kramer 1956–57 |
| B1 | c.0.2.07 | VAT 6481 · Berlin (poss. Sippar) | Krecher 1966 |
| N4 | c.0.2.08 | CBS 14077 + N 3637 + Ni 9925 | Hallo 1975 |
| B4 | c.0.2.11 | AUAM 73.2402 · Andrews Univ. | Cohen 1976 |
| Y2 | c.0.2.12 | YBC 16317 · Yale | Hallo 1982; Veldhuis 1997 |
| N6 | c.0.2.13 | CBS 8086 · Philadelphia | Michalowski 1980 |