artificial intelligence – Page 2 – R&A IT Strategy & Architecture

The Department of “Engineering The Hell Out Of AI”

Posted by gctwnlon February 7, 2024August 7, 2024in AI7 Comments

ChatGPT has acquired the functionality of recognising an arithmetic question and reacting to it with on-the-fly creating python code, executing it, and using it to generate the response. Gemini's contains an interesting trick Google plays to improve benchmark results. These (inspired) engineering tricks lead to an interesting conclusion about the state of LLMs.

Memorisation: the deep problem of Midjourney, ChatGPT, and friends

Posted by gctwnlon December 26, 2023August 7, 2024in AI, Background Knowledge7 Comments

If we ask GPT to get us "that poem that compares the loved one to a summer's day" we want it to produce the actual Shakespeare Sonnet 18, not some confabulation. And it does. It has memorised this part of the training data. This is both sought-after and problematic and provides a fundamental limit for the reliability of these models.

What makes Ilya Sutskever believe that superhuman AI is a natural extension of Large Language Models?

Posted by gctwnlon December 15, 2023August 7, 2024in AI, Background Knowledge, Uncategorized3 Comments

I came across a 2 minute video where Ilya Sutskever — OpenAI's chief scientist — explains why he thinks current 'token-prediction' large language models will be able to become superhuman intelligences. How? Just ask them to act like one.

The Truth about ChatGPT and Friends — understand what it really does and what that means

Posted by gctwnlon October 23, 2023August 7, 2024in AI, Background Knowledge7 Comments

On 10 October I gave an (enthusiastically received) explainer talk at the EABPM Conference Europe 2023, making clear what ChatGPT and friends actually do — addressing the technology in a non-technical but correct way — and what that means. That presentation fills the gap between the tech and the results. At the end you will understand what these models really do in a practical sense (so not the technical how) when they handle language, see not only how impressive they are, but also how the errors come to be (with a practical example), and what that means what we may expect from this technology in the future.

Share this:

Share this:

Share this:

Share this: