Our Exclusive AI Dataset


Comprehensive Text-to-Video Dataset for AI Training and Multimedia Applications

This dataset, designed to fuel text-to-video AI systems, contains 63 entries selected from a larger collection of 1000 data points. It is organized into six video categories: explanatory, everyday scenes, storytelling, documentary, user-generated content (UGC), and animation. The dataset is divided into two subsets (70% for training and 30% for testing) and includes files detailing attributes such as scene description, actions, language, duration, and more. It is primarily intended for use in multimedia search, automatic video creation, and video analysis, with applications across fields like education, advertising, and sociological analysis.


1. Objective (Text-to-Video Dataset)

This dataset is designed to train text-to-video AI models. It originally contains 1,000 data entries, but this sample includes 63.


2. Data Structure

2.1. Video Categories

  • Explanatory/Educational Videos: Detailed descriptions of concepts or processes.
  • Everyday Scene Videos: Captures daily life moments with rich visual and interactive details.
  • Storytelling/Narrative Videos: Stories with dialogues or narrations tied to visible events.
  • Thematic/Documentary Videos: In-depth narration explaining visual elements.
  • User-Generated Content (UGC): Videos from platforms with simple captions or subtitles.
  • Animated/Synthetic Videos: Precisely described actions and scenes, often with scripts or subtitles.

2.2. Technical Composition

  • data.csv: Complete dataset.
  • readme.md: Dataset usage guide.
  • Markdown Documentation: An extended version of the readme file.
  • Train and Test Folders:
    • train.csv: 70% of the data for training.
    • test.csv: 30% of the data for testing.

3. Field Descriptions

Field Description
TitleTitle of the video
Text DescriptionNarrative describing the scene or event.
Described ActionsKey actions described.
Emotions or ToneAmbiance or emotions in the description.
LanguageLanguage of the description.
DurationTotal video duration.
Content CategoryVideo classification.
LocationScene setting or location.
Audio PresenceIndicates if the video has audio.
EntitiesVisible objects, animals, or people.
SourceVideo origin (e.g., recorded, generated).
Creation DateRecording or creation date.
Tags/KeywordsKeywords for easier search.
URLLink to access the video.
Weather ConditionsRelevant weather information.
Time of DayMorning, afternoon, evening, or night.
ChannelThe channel owner of the video.

4. Use Cases

  • Multimedia Search: Enhance text-based search and video indexing.
  • Automated Content Creation: Generate educational or UGC videos from text.
  • Video Analysis: Detect emotions, objects, and visual elements in scenes.
  • Personalized Content: Tailor videos for ads or virtual assistants.
  • Specialized Applications: Create educational content, VR/AR media, or conduct social/psychological studies.
Download Dataset
Technical Documentation for the Text-to-Video Dataset “VidData”
Chatbot
Chatbot images
Welcome to Databoost. We are at your disposal for any assistance. How can we help you?

Chatbot images

Info

Databoost, registered in the United States, is an international company with offices and subsidiaries in Madagascar. Through this global structure, we provide superior quality solutions by combining American expertise and local Malagasy talent. We emphasize flexibility, creativity, and efficiency, with a commitment to serving our clients on a global scale while remaining deeply rooted in local realities.

Subscribe to our newsletter