A smartphone screen displays an Instagram Reel titled "DTC Babies Podcast". The video shows two AI-generated babies wearing headphones and speaking into microphones, alternating shots. On-screen captions display their dialogue.
Baby 1: Do you remember a few years ago when we didn't have all these AI creative strategists?
Baby 2: I, I wasn't even born yet.
Baby 1: Neither were you.
Travis: And it got over 12,000 in spend on Meta with a 5.41% click-through rate. It's been our top performing ad for a couple weeks. Let me show you exactly how we made it, and then I'll break down when you should actually use AI ads like this one.
A dashboard showing ad performance metrics. The "Traffic Video - Baby podcast..." ad has a spend of CA$12,869.13 and a CTR (outbound) of 5.41%.
Logos for AI tools appear on screen: ElevenLabs, ChatGPT, hedra, VEED.
Title card: "Step 1: Upload a photo of yourself into ChatGPT"
Travis: Step one is to upload a photo of yourself into ChatGPT 4o or 5, the image gen, to turn yourself into a baby. You're just gonna type in the prompt, "Make this person into a baby sitting wearing headphones in a podcast studio." And boom, AI baby version of you is ready to go.
Screen recording of ChatGPT interface. A photo of a man is uploaded with the prompt: "Make this person into a baby sitting wearing headphones in a podcast studio." ChatGPT generates an image of a baby in a suit with headphones.
Travis: Now, make a second baby by typing in, "Make this other person into a baby in the exact same podcast studio, but from a different angle." Then you're gonna download both images, and it's time to move on to your script.
Screen recording of ChatGPT interface. Another photo of a different man is uploaded with the prompt: "Make this other person into a baby in the same exact podcast studio, but from a different angle." ChatGPT generates an image of a smiling baby with glasses and headphones.
Title card: "Step 2: Write out the dialogue we want the babies to say"
Travis: Step two is really gonna be to write out the dialogue that you want your babies to say in this ad. Here's a part of our actual script that we came up with. You can see that our approach was to use silly Gen Z slang like "bussin'" or "that slaps" or "this straight fire."
A document showing a script with timestamps and dialogue. Lines include "AI career strategies.", "Straight fire.", "A hundred percent.", "What's your favorite task to run in motion?", "I like generate winning UGC scripts.", "Oh, that slaps.", "That slaps.", "Super helpful when I just need to get an ad out quick!", "What about you?", "I love trim the fat.", "Y*** yeah, baby.", "The fat is bussin' for real."
Travis: Our approach was we want to make this even funnier than just two babies doing a podcast. Go over the top and make it even sillier and more ridiculous, and you'll probably end up more successful. We kept each line short, five to ten words max, and that was gonna help to make the animation smoother and easier to generate.
Two Google Docs side-by-side, titled "Baby script 1" and "Baby script 2", showing the separated dialogue for each baby.
Travis: We learned the hard way that if you try to input a super long script, it's gonna get all messed up. So just write it out, and then make sure you separate the lines that you want baby one to say from the lines from baby two to say into two separate documents, and you can upload them and create them separately.
Title card: "Step 3: Create the audio"
Travis: Step three is to create the audio from the script that you just wrote. So while we could have recorded the audio ourselves, we decided to use ElevenLabs' text-to-speech tool. It's super easy to use, and we just typed in the lines for baby one, generate, and give that a listen to make sure it sounds good.
Screen recording of ElevenLabs interface. Text is pasted into the text-to-speech box. The voice "Mark - Natural Conversations" is selected.
Travis: And then do the same for baby number two. Export each baby's audio as separate files that we're gonna throw into our video editor a little bit later.
Downloading the generated audio files from ElevenLabs.
Travis: Also, you want to get audio of baby one reading some voiceover that I'll be playing with some B-roll footage that you're gonna do and cut to after the podcasting part of your ad is over. It's great to have the same voice for consistency of one of your, your babies that's talking. That's what we did, and we think it made it more successful.
Text on screen: "Voiceover for Baby 1 Reading B-roll". Below it is a paragraph of text starting with "Browse the AI career of strategist marketplace in Motion..."
Title card: "Step 4: Get your talking babies made inside of this tool"
Travis: Step four is to get your talking babies made inside of this tool. We used Hedra, or Heedra. There are a few tools that do this, but we went with Heedra because it's ten dollars a month and it includes a realistic body movement, which is what we were looking for.
Screen recording of the Hedra website homepage and pricing page, highlighting the $10/month Creator plan.
Travis: So we're gonna upload the first baby image, add the audio file that we got from ElevenLabs for baby one, and hit generate. This part takes 30 to 60 seconds, but you're gonna look at that. Mouth movements, head tilts, subtle body language. For social media ads, this is totally gonna work.
Screen recording of Hedra interface. The baby image and audio file are uploaded. The generated video shows the baby's mouth moving and head tilting in sync with the audio.
Travis: Now do the exact same thing that you did with baby one for baby two. Download both videos in the highest quality that you can.
Screen recording of Hedra interface showing the second baby image and audio being processed into a video.
Title card: "Step 5: Import the footage into the editor and make cuts"
Travis: And then we're onto our last step. You're going to chop up these Heedra files that you downloaded into a video editor. You can use CapCut, we used VEED for this one, and we'll cut between them back and forth like a real podcast conversation.
Screen recording of VEED video editor interface. The two generated baby videos are imported.
Travis: Where you have baby one speaking, then cut to baby two, and then back and forth a couple times. Now you're just gonna add those essential elements like captions, a title card. We added a little disclaimer. We put in some B-roll and screen recordings of our actual product, and this grounds it and shows that we're actually selling something at the end here. Then you export it.
Screen recording of VEED interface showing the editing process. Captions are added. A title card "DTC Babies Podcast" is added. A disclaimer text is added. B-roll footage of a software interface is inserted.
Travis: Total creation time took about 45 minutes once you understand the process. Now let's talk about when to use ads like this. The performance marketing expert Alex Cooper, a good friend of Motion, said it best in my opinion. He wrote, "People are way more forgiving of AI when it's fun." It's just helping our audience have a little bit of fun.
A LinkedIn post by Alex Cooper, CEO of Adcrate. The text "People are way more forgiving of AI when it's fun." is highlighted.
Travis: And that's it. That's our whole strategy with this. Because right now, if you try to pass AI video ads as real, you're just going to erode trust.
Two images side-by-side. Left: A split image of Will Smith eating spaghetti, labeled "AI" and "Reality". Right: A split image of an eye, labeled "REAL" and "AI". The word "TRUST" appears in large red letters over Travis.
Travis: So our approach is to make it ridiculous, make it entertaining. Talking babies, talking animals. I also like what Alex said where he tries to come up with visceral ideas that make the invisible visible with AI, which is perfect for testing in your ads.
The same LinkedIn post by Alex Cooper. The text "I try to come up with visceral ideas that make the invisible visible." is highlighted.
Travis: So he notes most problems are invisible: stress, confusion, data overload. AI lets you visualize them in exaggerated ways that can become really effective in your ads.
Text on screen: "Most problems are invisible", followed by "Stress", "Confusion", "Data overload".
Travis: That's our tip. If you want to check out Motion, click the link in the description so you can analyze your crazy AI ads like we do and like the best of the best do.
Text on screen: "Click the link in the description"