| Beyond Top Down? A discursive (that is to say long and rambling) essay remembering the author's primitive attempts to program his computer to write poems, and asking to what extent AI can do better. Probably of interest to nerds only. ![]() Grok imagines Dr Leavis being confronted by a robot. Sometime back in the seventies the philosopher Dorothy Emmett, (who had been my tutor briefly at Manchester) met the great, and often abrasive, literary critic F.R. Leavis at a party. Making conversation, she asked: ‘Did you know that there is now a computer that can write a poem?’ I’m sure she relayed the news enthusiastically. (She could be very enthusiastic, and was even enthusiastic about one of my essays once.) Leavis was unimpressed, and returned to this conversation in several of his late writings, declaring that anyone who thought a computer could write a poem could have no idea what a poem is. He valued poems that were the deepest and subtlest expression of human thought and feelings; to suggest that a mere machine could create one was a kind of blasphemy. There is much to be said for
Leavis’s attitude, but there is definitely another side
to the question. Yes, yes, a poem can be ‘the precious
lifeblood of a master spirit, embalmed and treasured up
on purpose to a life beyond life’, but a poem is also an
arrangement of symbols. Computers were invented to
organise and arrange symbols – so why can’t they write a
poem? It’s a question that strongly intrigued me at one period in my life. When I first owned a computer (a Sinclair Spectrum, back in 1982) I learned the basics of programming, and became interested in the way that a simple program, given a vocabulary of words whose grammatical function was defined, could generate English sentences. I enjoyed the nonsense sentences it produced: ‘The greengrocer strangled the cold fish slowly.’; ‘A rancid aunt likes bananas.’ I would set the machine to print out random sentences. Then it printed one that brought me up short: ‘My knife imagined a death.’ This sentence had a resonance
missing from the other, sillier products of the program.
It almost had a poetic feel to it, you might say. Which set me thinking about why
it sounded poetic to me. Was it the dramatic
subject-matter? Was it the personification of the knife?
Was it the apparent compression of so large an idea into
five words? Or was it just that it seemed to fit into a
classification of sentences that might sound poets?
My homage to this appeared in
Snakeskin – The Collaborative
Sonnets. In these poems lines by Linda
Crespi and William Shakespeare are randomly interleaved
to produce alternative versions of the poem. (This was
written in JavaScript, a language which has evolved
since then. Like a number of my other JavaScript works,
the sonnets still work on some browsers, but not on
many. Things change.)
Here’s a typical example:
If you want to read
thirty-three more examples of the program's versifying,
a pamphletful can be found at: https://www.snakeskinpoetry.co.uk/147gothp.pdf So is this computer-fashioned thing a poem? If I took it along to a poetry workshop, concealing its origins, I’d expect it to be challenged in all sorts of ways, about its quality, its attitude and its limitations, but I don’t think it would be automatically rejected as machine-made. And if it’s a poem, who is the poet?
That’s almost the limit of
where I got to with programming poetry. I had vague
plans for a much bigger program that would somehow adapt
Vladimir Propp’s ideas about the morphology of the
folk-tale to generate stories that could then be
expressed in verse. I think I made a small start on
it, but other projects got in the way. I also
considered the problem of programming rhyme; not
impossible, but one would need to make a large
word-hoard available. Just as my sentence programs
had got the grammar right according to the rules, but
with random disregard of what the words might actually
mean, so in the poetry programs I had dealt moderately
well with the syntax of poetry, but not at all with the
semantics. In other words, the program arranged the
phrases nicely, but with no regard to what they meant.
It was top-down writing, with no notion connection of a
reality rooted in experience.
The verse is clunky, and the idea of what a poem is has not got far past the Hallmark cards stage of sophistication. What it does have is a good vocabulary; it has looked up th idea of triggering, and has produced an impressive list of synonymous phrases, arranged in more-or-less iambic tetrameters, and rhyming in couplets. The thing disintegrates rather in the third stanza. The metaphorical trigger now seems to be on an actual gum, and the punctuation gives one no confidence that the poem knows what it’s on about (why those commas after ‘felt’ and ‘know’?) One gets the idea of the
program fishing for things it could say, and striving to
say them in a way that could be taken for a poem.
Large language Model programs
use reasoning to try to work out what is true. When we
ask them question, they often come up with accurate
answers. Sometimes, though, they can seem like the
student who has done all the reading, but has missed the
point. They scour large numbers of documents to come up
with an answer, but know nothing outside those
documents. They therefore reflect the attitudes of those
documents. They can’t venture outside the world created
by the language of the documents. Except that on
occasion, it seems, they can be tempted; lawyers ask AI
to help them make a case, and sometimes, too helpful,
the program creates references to case law that would be
very helpful if it were not fictitious. Seeing that the
genre of legal argument needs to have cases as proof, it
invents, or hallucinates them. In this kind of instance,
AI is like the rather too helpful person from whom you
ask directions. He doesn’t want do seem ignorant or
unhelpful, so he says something that might be true. ‘Go
up to the church and turn left,’ he says, in a helpful
voice. So off you go in entirely the wrong direction. For an example of faking, let’s see what Grok makes of the ‘Write a poem about trigger warnings’ prompt, two years after ChatGPT came up with the Hallmark-style rhyming:
This is a much more
sophisticated poem. It is in the chopped-up-prose style
of free verse that is the norm in Poetry Review
and such places. It has significant-sounding phrases
like ‘memories wearing someone else’s skin’. (I’m not
quite sure what that means, but I can imagine things
that it might mean, and they are more or less relevant.)
It has a rather drab catalogue of things that might be
triggers, but they climax with a good one, the mirror,
which does suggest a universal human experience. Then
the last line gives us a most effective sensual reminder
of petrichor, that special smell of rain on hot ground.
This AI model has grasped that a personal sensation is a
good way to end a poem. An internet search tells me that
there are plenty of contemporary poems that refer to
this. Grok has selected well, and has used something
that others have told him humans respond to strongly.
But he is proving it on what he’s been told about other
people’s pulses, not his own. This is a reasonably
thought-out poem, but thought is divorced from
feeling. If it reads like a poem - well, that's in
the eye of the reader again. It reads just like a huge
number of other inoffensive twenty-first century poems.
It's very easy to accept. To return to F R Leavis, that challenging man: he had a normative idea of poetry – what poetry ought to be. Not much came up to his standards. He notoriously dismissed much of modern poetry, and large swathes of the traditional canon, for not coming up to the mark. He praised poets like Keats and Blake whose thought was inextricable from feeling, and for whom poetic form was not an optional extra, but was the only way in which such a union could be expressed. He was critical of poets who allowed their writing to become mechanical, divorced from their keenest intelligence and feelings. He felt that Milton had been a bad influence of poets for two centuries because he often used his immensely effective verse as a way of conveying existing ideas, rather than deeply engaging with them. ‘‘A good deal of Paradise Lost strikes me as mechanical as bricklaying.’ Perhaps one or two poets in an age come right up to his standards, and he could always change his mind and deem them insufficient, as he finally seems to have done with Eliot.
Grok pictures (rather
romantically) John Keats uniting thought and sensation
when he bites into a peach. My QuickBasic programs were deliberately mechanical; they wanted to see how far mechanical poetising could take one. Today’s AI blurs the picture. It gives the illusion of being bottom-up, arising from experience. But it isn’t, and it can’t be. It is made of text, made of other people’s ideas. It is faking. As a writer of light verse, I often write parodies, pastiches and homages that are to some degree imitations. Is machine poetry on a different level from that. And does that matter? I
would argue that the big task of literary criticism in
the coming years will be to discriminate between the
fake and the real. Or, putting it another way, to
redefine what human writing is. A hundred and fifty
years ago the invention of photography posed a
challenge to artists and critics. Much of painting
since the Renaissance had been devoted to the precise
rendition of detail. Now photography could do that
better. The art history of the past century and a half
has been the history of artists responding to this
challenge in different ways. There is an intellectual
and artistic battle, and it's not over yet. Which
is why I'm very interested in Robin Helweg-Larsen's
experiment of offering AI his own poems to feed on.
The seed of his own poetry has been enough to raise
the game of the AI program considerably. And a game it
is - which we'll be playing for the next century,
whether we like it or not. ![]() George Simmers imagined as a cyborg made mostly of glass. If you've actually reached he end of this over-long essay, and have any thoughts about it, George Simmers would be pleased to hear them. |