Listen to this lesson
Estimated reading time: 9 minutes
The general-purpose AI assistants we've covered — ChatGPT, Claude, Gemini, Copilot — can do a lot. But the AI tool landscape is much broader than those four. There's a growing ecosystem of specialist tools that do specific things very well.
This lesson covers the major categories of specialist AI tools: what they do, when to use them, and which ones are worth knowing about.
AI image generators create pictures from text descriptions. You type what you want to see, and the tool produces an image. The quality has improved dramatically since the early days — modern tools can create photorealistic images, illustrations, logos, and artwork.
Midjourney — Widely considered the leader in image quality, particularly for artistic and photorealistic images. It runs through Discord (which takes some getting used to) and through a web interface. Paid only — plans start around $10 USD/month.
DALL-E 3 — OpenAI's image generator, built directly into ChatGPT. The advantage is convenience: you can generate images in the same conversation where you're working on text. Quality is good but generally a step behind Midjourney for artistic work.
Stable Diffusion — An open-source image generation model that you can run on your own computer (if you have the right hardware) or access through various online tools. Free to use locally, but requires technical knowledge to set up. Many online tools are built on top of Stable Diffusion.
Adobe Firefly — Adobe's AI image generator, integrated into Photoshop and other Adobe Creative Cloud tools. Its key advantage: it's trained only on licensed images, which makes it safer for commercial use. It's included with Creative Cloud subscriptions.
Leonardo.ai — A popular tool for generating game assets, illustrations, and concept art. It has a free tier and paid plans, and offers good control over the style and quality of outputs.
AI code assistants help people write, debug, and understand computer code. Even if you're not a developer, it's worth knowing these exist — they're changing how software gets built.
GitHub Copilot — The dominant AI coding assistant. It works inside your code editor (like VS Code) and suggests code as you type, similar to autocomplete but much more powerful. It can write entire functions, explain existing code, and help debug errors. Costs $10 USD/month for individuals; free for students and open-source contributors.
Cursor — A code editor with AI built in from the ground up. Rather than being a plugin for an existing editor, Cursor was designed around AI-assisted coding. It's popular with developers who want AI deeply integrated into their workflow.
Claude and ChatGPT — Both general-purpose assistants are surprisingly capable at coding tasks. You can paste code into a conversation and ask for explanations, debugging help, or modifications. This is often the simplest approach for people who aren't professional developers.
AI transcription tools convert speech to text — meetings, interviews, lectures, podcasts. They've become remarkably accurate.
Otter.ai — A popular transcription tool that integrates with Zoom, Google Meet, and Microsoft Teams. It can join your meetings automatically, transcribe in real time, and generate summaries. Free tier includes limited transcription minutes; paid plans start around $17 USD/month.
Whisper (by OpenAI) — An open-source speech recognition model that's exceptionally accurate across many languages and accents. You can use it through various apps and services built on top of it, or run it locally if you're technically inclined.
Microsoft Copilot in Teams — If your organisation uses Microsoft Teams with Copilot, meeting transcription and summarisation is built right in.
Google Meet transcription — Google's built-in meeting transcription, available on certain Google Workspace plans. Works well for English; other language support is growing.
Descript — A tool that combines transcription with audio and video editing. You can edit audio by editing the transcript text — delete a sentence from the transcript and it's removed from the audio. Particularly useful for podcast and video creators.
Traditional search engines give you links. AI-powered search tools give you answers, synthesised from multiple sources.
Perplexity AI — An AI search engine that answers questions with cited sources. It's like having a research assistant that reads multiple articles and gives you a summary with references. Free tier is generous; Pro costs $20 USD/month for more powerful model access.
Google AI Overviews — Google now shows AI-generated summaries at the top of many search results. You don't need to do anything special — they appear automatically. Quality varies; always check the linked sources.
ChatGPT with web browsing — When you ask ChatGPT a question that requires current information, it can search the web and synthesise results. Useful for quick research.
Microsoft Copilot search — Copilot's standalone chat includes web search, providing sourced answers to questions.
Explore one specialist tool.
1. Which AI image generation tool is integrated directly into Adobe's Creative Cloud?
Answer: c) Adobe Firefly — it's built into Photoshop and other Adobe tools, and trained on licensed images.
2. What does Perplexity AI do differently from a traditional search engine?
Answer: b) It synthesises information from multiple sources and provides answers with citations, rather than just giving you a list of links.
3. What is a key limitation to be aware of when using AI transcription tools?
Answer: c) Accuracy drops with strong accents, technical jargon, or poor audio quality — always review transcripts for important content.

Visual overview