We're a travel itinerary service that uses AI to build itineraries. So it's fitting — and a little absurd — that we've spent the last few weeks building a fully automated pipeline to generate AI travel Reels and publish them to Instagram, YouTube Shorts, and X. No human hands on video production. No stock footage. No hired creators. Just code, APIs, and cron jobs running while we sleep.

This is an honest accounting of what 48 hours of data looks like across 8 different Reel formats. Some formats are working remarkably well. Others have been an embarrassing waste of compute. We're sharing the real numbers — not because everything looks great, but because this kind of data is genuinely hard to find and we think it's worth putting into the world.

The meta irony isn't lost on us: an AI travel company publishing AI analysis of AI-generated content. We'll try to be useful anyway.

The Experiment

tabiji.ai is a free AI travel itinerary service. We build personalized trip plans from Reddit recommendations and real traveler data. That's the core product. But we wanted to grow the brand on social media without burning budget on traditional content creation, so we built a fully automated short-form video pipeline.

The setup: cron jobs run throughout the day, pick a destination and format from a queue, generate a video end-to-end, and publish it to Instagram. The same video then gets cross-posted to YouTube Shorts and X. No humans are involved in the video production itself — only in building and maintaining the pipeline.

Over 48 hours, we published across 8+ distinct Reel formats and accumulated enough data to see clear signal. Here's what we tested:

  • Tourist Mistake — "#1 Mistake Tourists Make in [City]"
  • Scam Alert — "Every Tourist Falls for This Scam in [City]"
  • Vintage POV — AI-generated footage styled as archival travel film from the 1960s–70s
  • One Thing You Can't Miss — "[City]'s Most Underrated Attraction"
  • This vs That — "Tourists Go HERE, Locals Go HERE"
  • Budget Reel — "What $50/Day Gets You in [City]"
  • POV — Generic first-person walkthrough footage
  • Countdown / Top 5 — Top 5 things to do in a destination

The Tech Stack

For the curious: every Reel is generated entirely by machines. The pipeline looks like this:

  • Video generation: Veo 3 (Google DeepMind) — generates 8-second cinematic clips from text prompts at 9:16 portrait ratio
  • Image generation: Nano Banana 2 (Gemini-based) — for formats that use still images rather than video
  • Music: MiniMax — AI-generated background music matched to the destination and mood
  • Text overlays: FFmpeg — applies title, subtitle, and branding text on top of the video
  • Publishing: Instagram Graph API → YouTube Data API → X API v2, all automated
  • Orchestration: Shell scripts + cron jobs running on a Mac Mini

The cost per Reel is roughly $0.06–$0.12 in API calls, depending on the format. The real investment is time spent building the pipeline — not ongoing production costs.

What's Working

1. Tourist Mistake — The Clear Winner

✅ Strong Performer

Consistently breaks 500+ views on IG, 700+ on YouTube. The format that generates the most saves and shares by far.

The "#1 Mistake Tourists Make in [City]" format is performing significantly above everything else. Here are the raw numbers:

Reel IG Views IG Shares IG Saves YT Views
#1 Mistake — Koh Phi Phi 2,175 22 9 828
#1 Mistake — Hoi An 2,020 13 784
#1 Mistake — Bangkok 533
2,175
Peak IG Views
22
Shares (Koh Phi Phi)
1.9%
Engagement Rate
828
Peak YT Views

The Koh Phi Phi reel hit 1.9% engagement — saves plus shares divided by views. For an account with essentially no followers running fully AI-generated content, that's not a number we expected to see. The saves tell the story: people are bookmarking these videos to come back to before their trip. That's exactly the behavior you want from a travel content brand.

This format works for both platforms simultaneously, which is rare. Most formats that hit on Instagram are flat on YouTube, and vice versa. Tourist Mistake breaks that pattern.

2. Scam Alert — IG's Biggest Number, Strong on YouTube

✅ Strong Performer

Prague hit 3,188 views — the highest single Reel we've published. Marrakech followed at 2,139. Strong on YouTube as well, essentially interchangeable with Tourist Mistake.

Reel IG Views YT Views
Scam Alert — Prague 3,188 750
Scam Alert — Marrakech 2,139
Scam Alert — Cairo 180 846

The Prague Scam reel is our highest view count to date: 3,188 on Instagram. Cairo is newer, so the IG number is still climbing — but it already hit 846 views on YouTube, the highest YouTube number across all formats. The Scam format and the Tourist Mistake format are algorithmically interchangeable on YouTube; both get picked up and distributed at similar rates.

One interesting outlier: Cairo's IG views (180) are much lower than its YouTube views (846), which is the inverse of most patterns we see. We don't have a clean explanation for this yet — it might be timing, hashtag distribution, or the specific visual style of that clip.

3. Vintage POV — YouTube's Sleeper Hit

📊 Platform-Dependent

Surprisingly strong on YouTube with the best engagement ratio of any format. Essentially dead on Instagram. Worth running — but YouTube-only.

Reel IG Views YT Views YT Likes YT Comments
Vintage POV — Paris 1962 119 746 13 1
Vintage POV — Tokyo 134 168
Vintage POV — Kyoto 110 60

Paris 1962 is the most interesting data point in the entire dataset. On Instagram: 119 views, zero saves, zero shares. On YouTube: 746 views, 13 likes, 1 comment. That's the best engagement ratio of any video we've published — 1.7% like rate on YouTube, which is genuinely good for Shorts.

The Vintage POV format generates AI footage styled as archival 16mm travel film. That aesthetic clearly resonates with a YouTube audience that skews nostalgic and has longer watch time expectations. On Instagram, the same content is invisible — the algorithm and the audience both want something actionable, not atmospheric.

The lesson: don't cross-post this format. Run it YouTube-only and let the nostalgic algorithm do its thing.

What's Not Working

1. "One Thing You Can't Miss" — Weakest Format Overall

❌ Kill It

Every single Reel in this format is under 150 views. Zero saves across 6+ published Reels. Zero shares. We're pausing this format immediately.

Reel IG Views Saves Shares
One Thing — Hanoi #1 142 0 0
One Thing — Hanoi #2 133 0 0
One Thing — Hanoi #3 123 0 0
One Thing — Hanoi #4 122 0 0
One Thing — Hanoi #5 119 0 0
One Thing — Hanoi #6 115 0 0
One Thing — Hanoi #7 (newest) 9 0 0

This data tells two separate stories. The first is that "One Thing You Can't Miss" is a weak hook — it doesn't create urgency, doesn't trigger fear or curiosity, and doesn't give the viewer a specific reason to watch right now. The second is that publishing 6+ Reels about the same city (Hanoi) in 48 hours dilutes each one. The algorithm almost certainly suppresses repeat content from the same destination cluster. We compounded a bad format with a bad distribution strategy.

The declining view counts across the Hanoi series (142 → 133 → 123 → ...) look like diminishing returns from algorithmic suppression, not just recency. We're retiring this format and diversifying destinations more aggressively going forward.

2. "This vs That" — The Conceptually Sound Failure

❌ Pause

The concept is solid. The execution — AI-generated video — can't deliver what the format promises. People want to see the contrast, not read about it.

Reel IG Views Likes Saves
This vs That — Saigon #1 132 1 0
This vs That — Saigon #2 130 0 0
This vs That — Saigon #3 127 0 0

"Tourists go to Ben Thanh Market. Locals go to Binh Tay Market." This is genuinely good travel advice. The problem is that showing the difference is what makes the format work — a split-screen of the crowded tourist market versus the authentic local one, actual footage, real contrast. Our AI-generated video clips aren't differentiated enough to make this viscerally clear. The text says "contrast" but the visuals don't deliver it. The format fails not because the idea is bad but because it requires something AI video generation can't yet consistently produce: clearly distinguishable real-world visual contexts.

3. Budget Reels — Useful but Forgettable

📊 Middling

115–131 views consistently. One save across four Reels total. People watch but don't save or share because there's no emotional hook — just information.

Reel IG Views YT Views Saves
Budget — Vienna ($50/day) 117 2 0
Budget — Leipzig ($40/day) 117 0
Budget — Ghent ($45/day) 118 240 0
Budget — Marseille ($50/day) 131 1

Budget content performs better than the worst formats, but it doesn't convert. No one saves a Budget Reel because the information feels disposable — you watch, you absorb a rough number, you move on. Compare that to the Tourist Mistake format, where the save behavior is strong because people want to reference the warning before their trip. Budget information doesn't have that urgency.

The exception: Ghent Budget hit 240 views on YouTube Shorts, which is nearly our platform average for decent content. Budget may perform better on YouTube than Instagram — we'll watch this more carefully.

4. Generic POV — Don't Bother

❌ Dead Format

Cappadocia: 116 views, 0 likes, 0 saves, 0 shares. A format with no differentiation gets no algorithm love.

A first-person walkthrough of Cappadocia's hot air balloon landscape — without a hook, without information, without a specific angle — gets 116 views and nothing else. The AI-generated footage isn't bad; the format just has nothing to say. Vibe-only content might work for creators with massive followings, but it doesn't work for algorithmic distribution to cold audiences.

Platform-by-Platform Breakdown

The most surprising finding in this data isn't which formats work — it's how dramatically the same content performs differently across platforms.

Instagram: Fear and Utility Win

Instagram's algorithm rewards content that people react to immediately. The formats that drive saves and shares get distributed. Saves are the most powerful signal — they tell the algorithm that someone found this useful enough to bookmark, which signals genuine value rather than passive entertainment.

On Instagram, the top performers share two traits: negative framing (mistake, scam, warning) and practical utility (something you'll actually use before your trip). Atmospheric content — beautiful footage, vibes, nostalgia — doesn't move the needle.

Format IG Performance Saves/Shares
Scam Alert2,139–3,188 viewsHigh
Tourist Mistake533–2,175 viewsHigh
Budget117–131 viewsMinimal
Vintage POV110–134 viewsZero
One Thing9–142 viewsZero
This vs That127–132 viewsZero
Generic POV116 viewsZero

YouTube Shorts: The Algorithm Rewards Nostalgia and Information

YouTube's short-form algorithm behaves differently from Instagram's. Watch time matters more — if someone watches the full 8 seconds, that's a strong positive signal. The Vintage POV format benefits from this: it's atmospheric and visually compelling enough that people watch it through, even if they don't save or share it on Instagram.

On YouTube, Scam and Tourist Mistake formats perform similarly to Instagram (750–846 views each), but Vintage POV also shows up strongly (746 views, 13 likes for Paris 1962). The platform's nostalgic audience — people who grew up watching travel documentaries — responds to the 1960s film grain aesthetic in a way Instagram's scroll-hungry audience doesn't.

X/Twitter: Basically Zero

We should just say this clearly: X is not working. Not a little bit — completely not working.

Platform Typical Range Best Result Likes
X (@tabijiai) 0–12 views 12 views 0

Every post: 0–12 impressions, 0 likes, 0 engagement. This isn't a content quality problem — the same videos are hitting 2,000+ views on Instagram. This is a distribution problem. The @tabijiai account has minimal followers, and X's algorithm does not surface video content from new accounts the way Instagram and YouTube do. The free-tier text+link approach that works for link sharing on X doesn't translate to video discovery.

We haven't invested in X engagement strategy (replying, following, community building) for the @tabijiai account, and it shows. X requires relationship-building before algorithm amplification in a way Instagram and YouTube don't. For now, we're treating X as a low-priority cross-post and not optimizing for it.

The Psychology Behind the Hooks

The data isn't just descriptive — it explains something about how people engage with travel content on social media. The winning formats all leverage the same psychological triggers.

Negative Framing Outperforms Positive Framing

"The #1 Mistake Tourists Make in Koh Phi Phi" outperforms "Koh Phi Phi's Best Hidden Gem" by a factor of 10x+. This isn't counterintuitive if you think about why people follow travel accounts: they're planning trips and they don't want to embarrass themselves or lose money. A warning is more immediately useful than an attraction recommendation. You can find things to do anywhere; avoiding a costly mistake requires specific knowledge.

Loss aversion is real, and it applies to travel content. The emotional weight of "this mistake could ruin your trip" is stronger than "this place is really cool."

Saves Signal Trip Planning Intent

The Tourist Mistake format generates saves; the Vintage POV format generates neither saves nor shares. Saves on Instagram represent deferred action — "I'm not going to Koh Phi Phi tomorrow, but I'll need this when I do." That's exactly the audience tabiji is trying to reach: people who are in trip planning mode, not just casual scrollers. The save behavior is a proxy for purchase intent in our context.

Specificity Creates Credibility

"Tourists make mistakes in Thailand" is forgettable. "The #1 Mistake Tourists Make in Koh Phi Phi" feels authoritative. The specificity of the destination implies specific knowledge, which implies credibility. The viewer assumes we actually know something about Koh Phi Phi — even if the content was generated by AI from aggregated travel data. The naming precision does a lot of work.

Fear > Curiosity > Information > Inspiration

Ranking the psychological hooks by performance across our dataset:

  1. Fear (scam, mistake, warning) — highest engagement, most saves
  2. Curiosity (unknown, secret, hidden) — moderate engagement
  3. Information (budget, logistics, tips) — watched but not saved
  4. Inspiration (beautiful footage, vibes, nostalgia) — low engagement on IG, better on YouTube

This ranking doesn't mean inspiration has no value — it means fear-based hooks are more powerful for algorithmic cold distribution to audiences who don't know your account yet. Once you have an audience that trusts you, inspirational content probably performs differently. We're not at that stage.

Where We Go From Here

The data gives us a clear prioritization. Here's what we're actually doing based on these 48 hours:

Scale What's Working

  • Tourist Mistake and Scam formats — increasing cron frequency, diversifying destinations globally. These two formats alone are driving nearly all meaningful engagement.
  • Vintage POV — YouTube-only — stopping IG cross-post for this format. It's earning real YouTube engagement and shouldn't be penalized for Instagram's algorithm preferences.

Kill What Isn't

  • "One Thing You Can't Miss" — retired. The format is too generic and we compounded the problem by flooding a single destination (Hanoi) with 6+ Reels in 48 hours.
  • "This vs That" — paused until we can solve the visual contrast problem. The concept works; AI video generation can't execute it yet.
  • Generic POV — dead. No hook, no distribution.

Open Questions

  • Does Tourist Mistake performance hold across less-traveled destinations, or is it partly driven by high-interest cities (Koh Phi Phi, Hoi An)?
  • Would Budget Reels perform better with fear-adjacent framing ("You're Overpaying in Vienna") rather than neutral information framing?
  • Can Vintage POV be adapted with an information layer (vintage framing + actual travel tips) to make it work on both platforms?
  • Is 48 hours enough data to kill a format, or are some of these low performers just not getting initial distribution? We're watching 7-day numbers carefully.

What This Means for AI-Generated Content

We're running what is — to our knowledge — one of the first fully automated AI short-form travel content pipelines in production at this scale. Some observations beyond just our own data:

The Hook Is More Important Than the Production Quality

AI-generated video from Veo 3 is genuinely impressive — cinematic, fluid, believable. But the Hanoi "One Thing" series had perfectly fine video and got 9–142 views. The Prague Scam reel hit 3,188 with similar production. The video quality is table stakes; the hook determines distribution. This is probably true for human-created content too, but it's especially pronounced when you can run controlled experiments at scale.

AI Video Still Can't Do Visual Contrast Reliably

The "This vs That" failure is a real limitation: AI video generation can produce types of footage, but it can't reliably produce footage that's visually distinct enough to make a contrast obvious. "Tourist street market" and "local street market" look similar coming out of Veo 3 unless you engineer the prompt very carefully — and even then, viewers watching at 1x speed on their phone can't necessarily parse the difference. This is a genuine gap in current AI video capability.

Platform-Specific Optimization Matters More Than Cross-Platform Efficiency

We built this pipeline assuming cross-posting was pure upside — same video, three platforms, more reach. The data shows that's mostly true, but Vintage POV is the counterexample: cross-posting actually hurts the YouTube metrics by association with a format that doesn't work on Instagram. If Instagram viewers skip Vintage POV quickly, that early exit behavior might signal to YouTube that the content is weak, even if YouTube's own metrics look fine. We don't know if this signal pollution is real, but it's a reason to segment platform strategies more carefully.

Fear-Based Travel Content Is Underserved

Most travel brands are optimizing for aspirational content — beautiful places, dream trips, wanderlust. The strong performance of Scam and Tourist Mistake suggests there's a real audience for utility-first, fear-adjacent travel content that nobody is aggressively serving. The information is genuinely useful (these scams exist, these mistakes do cost people money), the hook is strong, and the save behavior suggests it drives actual trip planning.

The best performing AI content isn't the most beautiful or the most creative — it's the most useful, packaged with the right emotional trigger.

We'll keep running experiments and sharing what we find. This pipeline is cheap to operate, fast to iterate, and produces real signal. The 48-hour snapshot above represents something like 80+ Reels published across formats and destinations — the kind of A/B test scale that would have cost thousands of dollars and months of production time in the old world of travel content creation.

Whether that's exciting or unnerving probably depends on which side of the camera you're usually on.