18+

Secrets AI Video Generator: How It Works, Quality, and Cost

Video generation from AI companion images is the feature that most clearly separates Secrets AI from the majority of platforms in this category. Character.AI, CrushOn AI, Janitor AI, and Replika do not offer it. Candy AI has limited video options. Secrets AI has built video generation as a first-class feature with dedicated quality models and a straightforward production workflow.

This page explains exactly how the video generator works, what it costs in Moments, what quality to expect across tiers, and who should actually use it versus who would be better served by staying with images or voice.

For context on how the video generator fits into the platform's complete offering, the full review covers all features with quality assessments.

What Makes This Feature Unusual

Most AI companion platforms rely on static image generation for visual content. The reasons video generation is uncommon are practical: it requires significantly more compute, longer generation times, and a more complex model architecture than static images. Platforms that have prioritized rapid iteration on conversation quality often defer video generation entirely.

Secrets AI's decision to implement video generation as a core feature — not an add-on or beta experiment — makes it a meaningful differentiator. Reviewer aggregators rate the video quality at 4.1/5, comparable to the platform's voice quality (4.3/5) and slightly below the chat quality score (4.4/5).

The competitive context is relevant: if video generation from a personalized AI companion matters to you, Secrets AI is currently the most accessible platform where it's a built-in feature. Only niche alternatives like SweetDream AI and Xotic AI (which offers 4K 15-second clips) come close on this dimension.

How Video Generation Works: Step by Step

The workflow is accessible from within any active companion conversation:

Step 1 — Select or generate a source image. Video is generated from an existing companion image, not directly from a text prompt alone. Start with a high-quality image of your companion — the generated video reflects the character appearance, clothing, and scene from the source image.

Step 2 — Enter a movement or action prompt. Write a short description of what you want the video to show. Keep prompts specific but not overly complex: "walking through a sunlit park" will produce more consistent results than a multi-part narrative description. The AI uses this prompt to animate the source image.

Step 3 — Wait for generation. Processing takes approximately 2 minutes per clip. This is not adjustable — the generation time is consistent regardless of clip length or complexity.

Step 4 — Review and save. The completed clip is presented for review. If the quality is acceptable, save it. If not, you can generate again with a revised prompt (at the full Moments cost again).

Short 3-second clips are available from the Lite tier. Longer clips and the Advanced generation model are unlocked on higher tiers.

Moments Costs for Video

This is where users most frequently underestimate their budget requirements. Video is the most Moments-intensive feature on the platform:

Video TypeMoments Cost
Short clip (3 seconds)~50 Moments
Standard video clip50-300 Moments
Full-length video~600 Moments

Compared to other features at the same Moments value:

Feature600 Moments Buys
Full video clip1 clip
Standard images12-24 images
Voice calls6 minutes
Text messages300-600 messages

This ratio is why Moments management becomes critical for video-focused users. On Plus tier (3,000 Moments/month), generating 5 full-length video clips exhausts the entire monthly allocation. On Premium (8,000 Moments), you can generate approximately 13 full-length videos per month while leaving nothing for other media.

Monthly video capacity by tier:

TierMonthly MomentsShort clips (50 each)Full clips (600 each)
Lite1,000~20~1-2
Plus3,000~60~5
Premium8,000~160~13
Ultimate15,000~300~25

For heavy video use — multiple full clips weekly — Ultimate ($39.99/month) is the only tier where the math works without constant Moments top-ups. For occasional video generation alongside regular chat and images, Premium ($19.99) provides a workable balance.

The video access by tier comparison page shows how Moments budgets play out across all media types simultaneously.

Quality Assessment

Video quality depends on two variables: the generation model (standard vs Advanced) and the quality of the source image.

Generation models:

  • Standard model: Available on Lite and Plus. Produces acceptable quality for most prompts. Some inconsistency on complex movements or detailed scene prompts.
  • Advanced model: Available on Premium and Ultimate. Produces smoother motion, better facial expression continuity, and more consistent character rendering across the clip.

Reviewer assessment: Videos "look good and move smoothly most of the time." Character movement and facial expressions are realistic in most outputs. Occasional quality variations appear with complex prompts — close-up facial expressions tend to render more consistently than full-body dynamic movement sequences.

Tips for better output quality:

  • Use the highest-quality source image available — video quality is bounded by the source
  • Start with simpler movement prompts before testing complex actions
  • Use the Advanced model (Premium/Ultimate) for the best results
  • Generate a 3-second short clip first to verify the prompt produces what you expect before committing to a full-length generation

The 2-minute generation time applies regardless of clip length, so testing a short clip before generating a full one costs only 50 Moments for the test versus up to 600 for a full clip that might not match expectations.

Comparing Video to Images and Voice

A practical budget question is whether to spend Moments on video, images, or voice. The answer depends on usage pattern:

Images (25-50 Moments each) provide more flexibility and lower per-unit cost. A 600-Moment budget buys 12-24 images versus 1 full video. Images are the better choice for users who want a library of companion content or who want to experiment with different scenarios.

Voice calls (100 Moments/minute) are best for users who value interactive audio engagement. The experience is real-time rather than generated — you chat, the voice responds live. For users who value the interactive dimension over saved media, voice often provides more value per Moment than video.

Video provides the most visually dynamic content but at the highest per-unit cost. It is the appropriate choice when you have a specific scenario you want to bring to life as motion content, or when you value saved video clips from your companion above other media types.

Competitors Without Video Generation

Understanding what makes the video generator distinctive requires noting what the competition offers:

  • Character.AI: No video generation
  • CrushOn AI: No video generation
  • Janitor AI: No video generation
  • Replika: No video generation
  • Candy AI: Limited video options, not a primary feature
  • GirlfriendGPT: No video generation

The absence of video on most major platforms is not an oversight — it is a product and infrastructure decision. Secrets AI has invested in this feature specifically. For users who want video content as part of the companion experience, the Moments costs analysis shows that even at higher tiers, the feature is accessible at reasonable pricing compared to standalone AI video generation tools.

Who Should Use the Video Generator

Video generation adds genuine value for a specific user profile:

Worth it if:

  • You want visual motion content from your AI companion, not just static images
  • You value the ability to save and revisit video clips of specific scenarios
  • You are on Premium or Ultimate tier and have the Moments budget to support regular generation
  • You want the qualitatively different experience of seeing your companion move rather than viewing static photos

Not worth it if:

  • You are on a tight Moments budget — video will drain it quickly
  • Text conversation is your primary use case — Moments are better spent on manual memory saves and occasional images
  • You primarily use the free or Lite tier — free does not support video at all, and Lite's 1,000 Moments/month limits you to roughly 1-2 full clips or 20 short clips

For users who want to explore video generation as part of a complete companion experience, starting with the how-to-use guide sets up the account and character configuration needed to produce good source images for video conversion.

Try Video Generation on Secrets AI →

FAQ

Video length varies by generation type and tier. Short clips are approximately 3 seconds and cost around 50 Moments. Full-length clips can be significantly longer and cost up to 600 Moments per generation. Lite tier supports 3-second shorts; Plus, Premium, and Ultimate unlock full-length clip generation.

No. Video generation is not available on the free tier. It requires a Lite subscription or higher. Free users receive 200 starting Moments but cannot use them for video — this capability is locked behind the paid tier threshold.

This depends on your subscription tier and clip length. At Plus (3,000 Moments/month): approximately 60 short clips or 5 full-length clips per month, or a combination. At Premium (8,000 Moments): approximately 160 short clips or 13 full-length clips. At Ultimate (15,000 Moments): approximately 300 short clips or 25 full-length clips. These numbers assume the entire monthly allocation goes to video — mixed use with images and voice reduces the available video budget proportionally.

Reviewer aggregators rate video quality at 4.1/5. Videos show realistic character movement and natural facial expressions in most outputs. The Advanced generation model (Premium/Ultimate) produces higher consistency than the standard model. Complex movement sequences are more prone to inconsistency than simple actions. Starting with clear, specific prompts and high-quality source images produces the best results.

Get Started