Weeknotes 276 - Is the end of prompt engineering near?

This week, the latest updates on AI tools and, specifically, the new approaches to unlocking generative AI. And much more, from robotics to proxy citizenship.

Weeknotes 276 - Is the end of prompt engineering near?
Proxy citizens, robotic-like ‘digital twins’ creating new social city fabric in neighbourhoods (Midjourney)

Hi, y’all!

Welcome to the newsletter, especially the new subscribers. There is a lot of interesting in the news going beyond the current platforms and new introductions in AI capabilities, making movies that also populate mainstream media. At the same time, Google is upgrading Gemini with some interesting features that might change prompt engineering, which definitely triggered a thought!

Before I continue, let me know if you want to dive deeper into these topics or create specific (near) future studies. Check out the Target_is_New website to find out what is possible.

Triggered thoughts

Apparently oio.studio did work Google’s Gemini new interface. That made me curious as I know them for their interesting thought experiments, as well as real experiments with more-than-human futures, partnerships with autonomous creatures, and creative AI overall.

The Gemini explanation shows how they create a reasoning interface that makes a debriefing of the intention of the human input. It makes me think about how we get more human interactions with machines, as we don’t need to adapt to machine-like thinking anymore to get the best. We got used to adapting our thinking a bit to the machine way of thinking, like with search engines. The first step was the translations from natural language to machine-understandable queries in the chat interfaces. Gemini seems to go further by adding an extra layer of ‘understanding’. So it is not only the interface like with chat interfaces. It is also the internal reasoning design that is made differently. In the demo, you see how, under water, Gemini is trying to make sense of what type of question it really is and is adapting the interface to that type.

Would this mean that the profession of prompt engineering is ending soon? For now, the example shown focuses mainly on presenting the search results. It is unclear if it is adjusting the findings, too, and if there is real understanding (see below). As a new approach of shaping the interface, it is following what Perplexity.ai is also trying (check out the interview with the founder at last week’s Hard Fork); finding a new form for building relations with our AI partners.

Events to track

Notions from the news

Why? Cringe. Also, it is interesting how he is trying to enter AR, where VR has always been the main use. While Apple is keeping its distance from reality, hiding branding…

Why Apple won’t call the Vision Pro “virtual reality”
Apple has forbidden developers from using “VR,” “AR,” or “MR” to describe their Vision Pro apps. That’s a mistake.

This week on AI

OpenAI to start. The most attention went out to the newly introduced text to video model Sora. You need enough computing power to create distributed movies, but it is definitely impressive. And it is now part of mainstream media, with attention on the potential ‘unintended’ consequences of fake. As always, it will probably not be the fake world leaders that cause the biggest impact, as there will be guardrails for that. There are many more indirect relations. Bullying will get another dimension, trusted officials can make you believe to make certain decisions, etc. There is a promising future for fake-scanning software (like virus scanners). Unhide the hidden dangers…

Sora Can’t Handle the Truth
Monkey see, monkey confabulate
The Era of Abstraction & New Creative Tensions
Let’s talk about creative tensions and the implications of interfaces that abstract away the sources of news, commerce, and information.

Sora is stimulating interest in Worldcoin, that other Sam Altman project.

And Google is also updating Gemini. Battle of the Giants.

ChatGPT vs. Gemini: Which AI Chatbot Subscription Is Right for You?
Everyone wants your $20 per month for access to their best AI chatbot. Who gets it depends on what features are important to you.

Advanced Machine Intelligence (AMI, pun intended?) for self-supervised learning.

V-JEPA: The next step toward advanced machine intelligence
We’re releasing the Video Joint Embedding Predictive Architecture (V-JEPA) model, a crucial step in advancing machine intelligence with a more grounded understanding of the world.

The now mundane chat-based interfaces are getting more and more enhanced and trustworthy. Long memory makes a difference in building a relationship with your AI buddy. Or AI Valentine.

OpenAI experiments with giving ChatGPT a long-term conversation memory
AI chatbot “memory” will recall facts from previous conversations when enabled.
The world’s first real AI-powered Valentine’s Day | Semafor
AI has now become an integral part of the online world of romance, giving experts a better understanding of how the technology could change dating apps and help bridge language barriers.
AI-powered romantic chatbots are a privacy nightmare
They collect massive amounts of data with little disclosure about its use.

At the same time, we are still at the beginning, and even easy tasks like creating interesting travel plans deliver boring outcomes. In my experience, it can deliver some (chat-)work to get some inspirational suggestions. It is just like the real world; the generic advice is the default, and real ‘geheimtips’ need extra work.

The AI Industry Is Stuck on One Very Specific Way to Use a Chatbot
OpenAI, Google, and Microsoft are dying to help plan your next trip.

Statistics versus understanding. There is a consensus that current generative AI does not really understand what it is generating or even what it is using as a reference to generate. On the other hand, what is understanding? Is it not understanding because of a different model of thinking or the lack of experience? What role does it play in the real embodiment of experiencing in understanding? And how do we relate to hallucinating systems, where we have summaries without the original?

Statistics versus Understanding: The Essence of What Ails Generative AI
The Foundation Remains Shaky
Summaries without originals
A recurring concern with LLMs is that tech companies will run out of data with which to train them and they will gradually become less and less useful. How would tech companies, which have perfected so many means of surveillance and data extraction, run out of data, especially when institutions

Philosophy might help out when our society is getting weird.

Can philosophy help us get a grip on the consequences of AI? | Aeon Essays
Generative agents will change our society in weird, wonderful and worrying ways. Can philosophy help us get a grip on them?

I remember that there have been computational music-composing buddies before, but now we have AI to add to the mix…

spin AI-music synthesizer encourages users to explore nuances of algorithmic music
spin by designer arvind sanjeev is an ai music synthesizer that allows you to co-create compositions with a language model, musicgen.

In case you want to keep track on the developments with the EU AI Act, a dedicated newsletter

And some potential positive (or optimistic) news on the AI footprint.

AI has a large and growing carbon footprint, but there are potential solutions on the horizon
Technological approaches could help reduce the carbon impact of artificial intelligence systems.

Some new tools and services are AI-enhanced, like Slack and Reddit, and some existing new AI tooling updated, like Stability AI. For the Dutchies, Erwin has been the master of finding tools for cyber punkers (making creative output with off-the-shelf tools, aka low code) for decades, and now he is keeping track of all interesting AI-based tools.

Spreek de taal van de prompts!
Vrienden, Tussen het schrijven van nieuwsbrieven door ben ik ook anderszins druk met AI bezig. Zo gaf ik recent een presentatie over de mogelijkheden van AI in de evenementenbranche en ben ik maandag van start gegaan met het geven van een 5-daagse AI-cursus voor de Vereniging van Nederlandse Poppodia en Festivals aan 15 deelnemers die aan zalen, festivals en concertorganisaties verbonden zijn. Leuk!

Where Erwin is often creating very functional applications for self-organising and publishing Matt Webb is almost creating an art installation with the same punk attitude.

New app! A compass that points to the centre of the galaxy
Posted on Thursday 15 Feb 2024. 1,568 words, 11 links. By Matt Webb.

Robot and autonomous round-up

If I read about autonomous space robots, I wonder if there are our delegated workers in space, more like a new breed of species.

NASA tests autonomous space robots for off-world construction
NASA is developing autonomous space robots to build shelters, solar arrays, and more on the moon and Mars.

What do you think about this application of the robot dog as guidance for the visually impaired? Does it make sense to use the dog as an archetype here for better acceptance both for the visually impaired and the outside world?

RoboGuide robot dog uses AI to assist the visually impaired - The Robot Report
Technologies such as RoboGuide could enable people with visual impairments to more fully navigate and interact with the world.

We need a new narrative for autonomous vehicles.

Autonomous Vehicles Have a Problem of Narrative
The tech industry needs to be honest about the trade-offs of risk

Platforms and beyond

I hope I find time to watch the talk of Cory Doctorow at Transmediale.

“So what's enshittification and why did it catch fire? It's my theory explaining how the internet was colonized by platforms, and why all those platforms are degrading so quickly and thoroughly, and why it matters – and what we can do about it.”

Pluralistic: My McLuhan lecture on enshittification (30 Jan 2024) – Pluralistic: Daily links from Cory Doctorow

Apple has a better image as other big platforms, but there is a danger they will do serious harm to the open web principles.

Apple appears to be breaking iPhone web apps in the EU
It might be a feature, not a bug.

I feel with Peter that the good old social internet (aka web 2.0) is in a transition. Whether it is pure fragmentation and reformation or a shift to something new (for years, we have predicted private social networks to become dominant). For a student design minor on interactive technology at Delft University of Technology themed future citizenship, Cities of Things is commissioning an assignment to think about proxy citizens, robotic-like ‘digital twins’ creating new social city fabric in neighbourhoods. Curious what will be the results…

Refractive Fragmentation — where did social media go?

Just nice

If you happen to be near Tokyo…

‘technology is not in conflict with nature’ - teamLab’s new borderless museum opens in tokyo
art collective teamLab presents its new borderless digital art museum, now relocated to azabudai hills in tokyo.

Paper for the week

Large Language Models: a Survey

The research area of LLMs, while very recent, is evolving rapidly in many different ways. In this paper, we review some of the most prominent LLMs, including three popular LLM families (GPT, LLaMA, PaLM), and discuss their characteristics, contributions and limitations.

https://doi.org/10.48550/arXiv.2402.06196

See you next week!