Soul

Enjoy animation results of Soul on Soul-Bench

Text prompt: The character in the image is singing, wearing an elegant diamond tiara, a white gown, and exquisite necklace and earrings, holding a microphone, with a focused and gentle expression, as if passionately performing on stage.

Text prompt: The man in the image is walking forward while speaking to the camera. The camera slowly pulls back, keeping him centered in the frame. He is wearing a white hard hat and carrying a black-and-yellow toolbox. Behind him is a room under renovation, with a ladder and a level placed nearby.

Text prompt: The character in the image is speaking directly to the camera, occasionally making gestures that match the content of their speech, with a fixed camera angle. This fluffy orange little monster is grinning widely, showing neat teeth, with round eyes blinking expressively. Its hands hang naturally at its sides, occasionally lifting excitedly to make a "V" sign or gently patting its little chest, appearing lively and adorable. The background is a warm orange-yellow, emphasizing the character's fluffy texture and vibrant colors.

Text prompt: The character in the image stands in a golden meadow filled with daisies, smiling and speaking directly to the camera, with hands naturally clasped in front, occasionally moving gently to match the speech content. The camera remains fixed, with a background of rolling green hills and a blue sky with white clouds, creating a serene and warm pastoral atmosphere.

Text prompt: The character in the image sings while playing the musical instrument in her hands, wearing elaborate ethnic attire, adorned with intricate silver jewelry and a colorful furry headdress, with two braids hanging over her shoulders. The background features vast grasslands under a bright blue sky with white clouds, with sunlight shining upon her, creating a dreamy and richly ethnic atmosphere.

Text prompt: The man in the image walks forward while speaking directly to the camera. The camera slowly pulls back, keeping him centered in the frame. He wears a black leather jacket and holds a black-and-white electric guitar, his expression focused and his steps steady. The background is an alleyway covered in vibrant graffiti, with warm yellow streetlights and neon signs flickering behind him, creating a gritty urban rock atmosphere. Occasionally, he gently strums the guitar strings with his left hand, while his right hand rests naturally on the body of the guitar, as if improvising a performance. The camera remains steady, emphasizing the interaction between the subject and his surroundings.

Text prompt: The character in the image is singing directly into a microphone, wearing professional headphones, with a focused expression. The background features recording studio equipment, creating an immersive music recording atmosphere.

Text prompt: The man in the image is standing in a bright office, speaking directly to the camera. He holds a pen in his right hand and wears a silver watch on his left wrist, occasionally using the pen to gesture in sync with his speech. The camera remains fixed. The background features floor-to-ceiling windows and a bookshelf, with city buildings visible outside. Inside, there are green plants and a desk, creating an overall professional and composed atmosphere.

Text prompt: The man in the frame is speaking directly to the camera, smiling while adjusting his bow tie with one hand. His right hand hangs naturally by his side, while his left hand holds a white suit jacket draped over his shoulder. A white boutonniere is pinned to the jacket. The camera remains stationary.

Text prompt: The character in the image is singing, holding a round fan adorned with floral branches, as if reciting poetry in a garden. The background features a classic ancient palace courtyard, with gentle sunlight cascading down and flowers swaying lightly in the breeze, creating a poetic and elegant traditional atmosphere.

Text prompt: The woman in the image is speaking directly to the camera, with a fixed shot. She is wearing elegant, long dangling earrings, has refined makeup, and a composed expression. The background is warm and soft, creating a sophisticated and serene atmosphere.

Text prompt: The woman in the image is speaking directly to the camera, occasionally using her right hand to gently brush her hair aside, with a fixed camera shot.

Text prompt: The woman in the image is speaking directly to the camera, occasionally raising her arms gracefully to gesture in sync with her speech. The camera remains stationary. She is wearing a black qipao and elegant earrings, with a blurred cityscape of nighttime lights in the background, creating a gentle and sophisticated atmosphere.

Text prompt: The woman in the image is singing, wearing light blue traditional-style clothing, adorned with an elegant diamond crown, holding a microphone, with a focused and gentle expression. The background features deep blue stage lighting, creating a dreamy atmosphere.

Text prompt: The little fox in the image is facing the camera and speaking, with big eyes full of curiosity, occasionally blinking, and the camera remains fixed.

Text prompt: The man in the image is speaking directly to the camera, with a fixed shot. The background features nighttime city lights, creating an urban atmosphere. He appears focused, wearing earrings and a dark shirt, exuding a mature and composed demeanor.

Text prompt: The man in the image is speaking directly to the camera, with his arms crossed over his chest, and the camera remains stationary.

Text prompt: The elderly man in the image is standing in front of a lush green plant background, holding a smartphone with both hands, speaking directly to the camera. The shot is static.

Text prompt: The man in the image is wearing a hiking backpack and holding trekking poles, standing in front of snowy mountains, speaking directly to the camera. Occasionally, he gently taps the ground with his trekking poles to match the content of his speech. The camera remains fixed.

Text prompt: The character in the image is seated gracefully by the window, wearing a light purple Hanfu, speaking softly with both hands resting naturally on their knees. The camera remains fixed. In the background, branches of blooming plum blossoms extend into the frame, while a celadon porcelain vase and a blue-and-white porcelain jar sit beside the window, complementing each other beautifully. Warm light filters through the window lattice, creating a serene and elegant classical atmosphere. Occasionally, the character gently lifts their gaze, as if making eye contact with the camera, or slightly nods their head, exuding a calm and composed demeanor.

Text prompt: The character in the scene happily holds a cute gift box in the snowy landscape, singing softly toward the camera as if gently humming a warm winter melody. Snowflakes gently fall in the background, with bare tree branches standing quietly, creating a cozy and healing atmosphere.

Text prompt: The man in the image is singing while playing the guitar in his hands.

Text prompt: The character in the image is singing, with a soft and blurred background. She softly hums a cheerful pop song, her eyes bright and full of emotion, as if sharing a beautiful moment with the audience. She wears pearl earrings and a pearl necklace, dressed elegantly and refined. The overall atmosphere is warm and dreamy, as if sitting by a sunlit window, conveying warmth and hope through her singing.

Text prompt: The man in the image is speaking directly to the camera, holding a yellow tape measure steadily with both hands, occasionally tapping or gesturing with the tape to emphasize his explanation. The camera remains fixed. The background features a busy construction site, with a high-rise building under construction, tower cranes, and a yellow excavator visible behind him. The scene is bathed in bright sunlight, creating a realistic and powerful atmosphere.

Text prompt: The character in the image is singing, her eyes softly shimmering and a faint smile playing on her lips, as if lost in the musical atmosphere. She is wearing a kimono adorned with delicate pink cherry blossom patterns, with a pink ribbon hair accessory in her hair. Sitting by the window under soft, warm light, she sways gently to the music, creating a serene and dreamy ambiance.

Text prompt: The woman in the image is facing the camera and speaking, with both hands naturally placed in her jeans pockets. Occasionally, she gently shakes her curly hair. The camera remains stationary.

Text prompt: The man in the image is wearing a blue three-piece suit, paired with a white shirt and a red patterned tie. He is smiling, with his right hand in his pocket and his left hand lightly resting on his suit jacket. As he walks forward while speaking to the camera, the camera slowly pulls back, keeping him centered in the frame.

Text prompt: The man in the image stands under an ancient stone colonnade, wearing an elaborate golden-embroidered ceremonial outfit with a deep red velvet shawl. His hands are clasped in front of him as he speaks directly to the camera. The camera remains stationary, while the warm golden sunlight and the weathered textures of the stone columns in the background create a rich, classical atmosphere.

Text prompt: The character in the image is wearing an exquisite traditional bridal gown, adorned with an elaborate floral crown, speaking directly to the camera with hands gently clasped in front of the chest, exuding a graceful and elegant demeanor. The camera remains stationary, focusing on the character's delicate facial features and intricate costume details, creating a dreamy and beautiful atmosphere.

Text prompt: The woman in the image is speaking directly to the camera, holding a bouquet of fresh flowers. Occasionally, she gently taps the bouquet with her fingers or slightly turns her head. The camera remains stationary.

Text prompt: The man in the image is speaking directly to the camera, with a fixed shot.

Text prompt: The character in the scene is singing, wearing a pink satin headband, with soft eyes gazing forward. The background features a warm, softly lit indoor environment, creating a serene and dreamy atmosphere.

Text prompt: The woman in the image stands in the center of an elegant hallway, speaking directly to the camera while slowly walking forward. The camera gradually pulls back, keeping her centered in the frame. She is wearing a sophisticated navy blue suit set, paired with a brown silk scarf and light-colored high heels, exuding calm confidence. The hallway features classical-style door frames and marble flooring on both sides, creating an atmosphere of refined elegance and grandeur.

Text prompt: Pikachu sits inside a transparent spherical capsule, which rests on a Poké Ball-shaped base. It faces the camera directly, speaking with its hands naturally placed in front of its body, occasionally moving them gently to match the rhythm of its speech. The background features a colorful Pokémon-themed amusement park, with a Ferris wheel and pink buildings visible. The camera remains fixed, keeping Pikachu centered in the frame, creating a warm and adorable atmosphere.

Text prompt: The man in the image is speaking directly to the camera, with a fixed shot.

Text prompt: The boy stands in the center of the art studio, holding a palette in one hand and a paintbrush in the other, speaking directly to the camera. Occasionally, he lightly taps the palette with the brush or raises his hand slightly to gesture in sync with his speech. The camera remains fixed. He wears a paint-splattered apron and a beret. The background features a weathered wall and two easels, while the floor is scattered with paint stains, creating a rich and immersive artistic atmosphere.

Text prompt: The woman in the image stands on a sandy beach by the sea, holding a surfboard, walking forward while speaking to the camera. The camera slowly pulls back as she moves, keeping her centered in the frame. She wears a straw hat, a vibrant floral-print dress, and a shell necklace, with a natural and relaxed expression. The background features blue ocean waves and a clear sky.

Text prompt: The woman in the image is sitting at a desk, speaking directly to the camera, with her hands resting naturally on the tabletop. Occasionally, she gently lifts her fingers to make elegant gestures that match her speech. The camera remains fixed. Behind her, a bookshelf and a desk lamp create a warm and intellectual atmosphere. She wears a pearl necklace and earrings, and her expression is calm and confident.

Text prompt: The character in the image is speaking directly to the camera with a fixed shot. The character is an adorable Shiba Inu with large, round eyes, featuring a tri-color coat of black, white, and brown. Its ears are perked up, and it is looking attentively at the camera, as if interacting with the audience. The background is simple and minimal, emphasizing the character's cute and expressive demeanor.

Text prompt: The character in the image is speaking directly to the camera, with a fixed shot.

Text prompt: The character in the image is speaking directly to the camera, with a fixed shot. She wears a blue-and-yellow headscarf, a pearl earring, and warm-toned clothing. The background features vibrant, colorful graffiti-like lines, creating a lively yet retro artistic atmosphere. Her eyes are gentle, and her lips are slightly upturned, as if softly narrating a story about art and youth.

Text prompt: The man in the image stands on the red carpet, wearing a striking pink cowboy outfit with a matching pink cowboy hat. His hands are naturally crossed in front of him as he speaks directly to the camera. The camera remains fixed. The background features the iconic black quilted wall of the Grammy Awards, creating an overall glamorous and vibrant atmosphere.

Text prompt: The man in the image is speaking directly to the camera, holding dumbbells firmly in both hands, occasionally performing small, controlled strength demonstration movements that match his speech content. The camera remains fixed.

Text prompt: The woman in the image is sitting at a wooden table in a café, holding a cup of coffee with both hands, speaking directly to the camera. Occasionally, she gently sways the coffee cup to match her speech. The camera remains stationary. She is wearing round-framed glasses, a gray fuzzy coat over a white turtleneck, with a natural and relaxed expression. The background features a warm-toned café setting with wooden tables and chairs, and blurred other customers.

Text prompt: The man in the image stands in a kitchen, holding a metal spatula in one hand, speaking directly to the camera, occasionally tapping or gesturing with the spatula for emphasis. The camera remains fixed. The background features a professional kitchen environment, with visible stovetops, pots, and spice racks, creating an authentic cooking scene atmosphere.

Text prompt: The character in the image is singing, wearing an elegant traditional-style dress with a gradient of blue and white. Her hair is styled in an elaborate updo adorned with blue flowers and gemstones. She gently raises her right hand, as if singing a melodious ancient-style tune. Her clear blue eyes gaze forward with a gentle and focused expression. Water droplets splash around her, while the background features hazy mountains, water, and bare branches, creating an ethereal and dreamlike atmosphere.

Text prompt: The character in the image is singing while DJing, operating the DJ controller with both hands, wearing pink headphones, and focused on adjusting the music rhythm. The background features a vibrant electronic music scene filled with neon lights, creating an energetic nightclub atmosphere.

Text prompt: The man in the image is singing while playing an electric guitar. He has long hair and a beard, wearing a black leather vest, with tattoos on his arms, and is deeply focused, immersed in the music. The stage lighting features warm yellow and red tones, with a blurred background showing the outline of a drum set, creating a lively and energetic live performance atmosphere.

Text prompt: The character in the image is singing in a recording studio, wearing professional headphones, with a microphone in front of him. He appears focused, and the background features recording equipment. The overall atmosphere is immersive and professional.

Text prompt: The man in the image is speaking directly to the camera, holding a walkie-talkie steadily with both hands. Occasionally, he lightly taps the device with his fingers or slightly adjusts his grip to match the content of his speech. The camera remains fixed. He is wearing a police uniform with a badge and shoulder patches. The background features an urban street lined with tall buildings, creating a solemn and professional atmosphere.

Text prompt: The woman in the image stands on a grassland, singing while playing the musical instrument in her hands. She wears elaborate ethnic attire, adorned with an intricately decorated headdress, and her two braids fall gracefully over her shoulders. Her expression is focused and gentle. The background features a blue sky with white clouds and rolling green hills, with sunlight bathing her, creating a serene and poetic atmosphere. The instrument she holds is a beautifully crafted stringed instrument, with finely carved body and strings that shimmer in the sunlight. Her movements are graceful, as if she is conversing with nature through her song. The entire scene is rich in ethnic charm and artistic appeal.

Text prompt: The elderly man in the image is speaking directly to the camera, holding a fishing rod naturally in his right hand, occasionally gently moving the rod to match his speech. The camera remains fixed. The background features a lush garden and a small stream, with soft sunlight and a peaceful, leisurely atmosphere.

Visualization of Soul-Bench

ID: 0129_human_talk_en_male

Tags: Male,Talking

Text prompt: The man in the image is speaking directly to the camera, with his arms crossed over his chest, and the camera remains stationary.

Talking content: I’ve spent years mastering the tools and machines that keep this workshop running. Every bolt, every gear — they’re more than parts to me; they’re a craft. I take pride in building and fixing things with my own hands. It’s not just a job; it’s a way of life. I value precision, safety, and hard work. If you need something done right, you come to me. That’s the kind of trust I earn — one project at a time. And yes, I wear those goggles not just for safety — they’re part of my identity. I’m proud to be a mechanic, and I’ll always stand by my work.

ID: 0499_comic_sing_cn_male

Tags: Male,Anime,Singing

Talking content: N/A

ID: 0285_comic_talk_en_female

Tags: Female,Anime,Talking

Talking content: 在这片金黄的草地上，每一朵雏菊都像在对我微笑。我最喜欢坐在这里，听着风穿过草叶的声音，感受阳光洒在脸上的温暖。虽然生活有时会有些小烦恼，但只要抬头看看天空，看看这些绽放的小花，我的心就会变得平静又明亮。希望你也常常停下脚步，去发现身边的美好，因为世界其实充满了温柔的奇迹。

ID: 0269_comic_talk_cn_female

Tags: Female,Talking,Anime

Talking content: 我是一位热爱传统文化的少女，喜欢在月光下绣花，听古筝的旋律。我向往宁静与美好，愿用双手传承祖先的智慧。每一片花瓣、每一针一线，都承载着我对生活的敬意。我愿与你分享东方的诗意与温柔，让这份美好如春风般拂过心田。愿世界多一份宁静，多一份对美的珍视。

ID: 0084_human_talk_en_male

Tags: Male,Talking

Talking content: Art is my voice when words fail. Every brushstroke is a piece of my soul, a silent conversation with the world. I paint not to impress, but to understand—to find meaning in the chaos. The studio is my sanctuary, where time slows and colors speak louder than language. I may not be famous, but I am true to myself. I paint for the joy of creation, for the beauty of imperfection, and for the hope that someone, someday, will see my work and feel something real. That’s enough for me.

ID: 0213_human_talk_en_male

Tags: Male,Talking

Talking content: In the kitchen, every dish is a story, and every ingredient has a voice. I believe in the power of food to connect people, to bring joy, and to express creativity. I’ve spent years mastering techniques, but I still approach each meal with curiosity and respect. Cooking is not just a job—it’s a craft, a passion, and a way of life. I’m always learning, always improving, and always striving to create something that not only tastes great but also tells a story. That’s what makes me a chef.

ID: 0191_human_talk_en_male

Tags: Male,Talking

Talking content: I am proud to wear the rich heritage of my culture. This attire isn't just clothing—it’s a story of generations, of artistry, and of identity. Every thread, every embroidery, speaks of tradition and dignity. I cherish moments like this, where I can connect with my roots and share them with the world. I believe in preserving our customs while embracing modernity. Let us honor our past, celebrate our present, and inspire the future with grace and pride.

ID: 0032_human_talk_en_male

Tags: Male,Talking

Text prompt: The man in the image is speaking directly to the camera, with a fixed shot.

Talking content: I believe in the quiet power of observation and reflection. Life moves fast, but I try to pause and truly see the world — in the details of a leaf, the expression in someone’s eyes, or the silence between words. I’m not always loud, but I’m deeply present. I find peace in nature, in music, and in the pages of a well-worn book. I’m learning to trust my own thoughts, to ask questions without needing immediate answers. I hope to live with intention, kindness, and a little bit of wonder every day.

ID: 0502_comic_sing_cn_female

Tags: Female,Anime,Singing

Talking content: N/A

ID: 0302_comic_talk_en_female

Tags: Humanoid,Talking

Text prompt: The little fox in the image is facing the camera and speaking, with big eyes full of curiosity, occasionally blinking, and the camera remains fixed.

Talking content: Hello, friend! I’m little Foxie, and I love exploring the woods and making new friends. Every day is an adventure—whether I’m chasing butterflies, finding the perfect leaf, or just sitting and watching the clouds. I’m always excited to learn something new and share my discoveries with others. If you ever need a companion on a sunny day, I’d be happy to join you! Let’s go on an adventure together!

ID: 0131_human_talk_cn_female

Tags: Female,Talking

Talking content: 在咖啡馆的暖光下，我最喜欢捧着一杯热咖啡，静静地读一本书，或者看着窗外的行人发呆。生活不需要太匆忙，慢下来才能发现那些细微的美好。我喜欢用镜头记录下每一个瞬间，也喜欢用文字表达内心的想法。或许我有点内向，但我的世界很丰富。希望你也愿意偶尔停下脚步，感受生活的温度。

ID: 0029_human_talk_en_female

Tags: Female,Talking

Text prompt: The woman in the image is speaking directly to the camera, occasionally using her right hand to gently brush her hair aside, with a fixed camera shot.

Talking content: I believe in the power of quiet strength and thoughtful action. Life is not about rushing through moments, but about truly experiencing them. I find inspiration in the details — a window’s reflection, a stranger’s smile, the way light falls on a city street. I’m passionate about creating meaning in my work and my relationships. I value authenticity and strive to live with purpose. Whether I’m capturing a moment through my lens or navigating a new city, I’m always learning, growing, and embracing the journey.

ID: 0012_human_sing_cn_female

Tags: Female, Singing

Talking content: N/A

ID: 0009_human_sing_en_male

Tags: Male, Singing

Talking content: N/A.

ID: 0085_human_talk_en_male

Tags: Male,Talking

Talking content: I've spent over 30 years building structures that stand the test of time. Every beam, every brick, every measurement matters. I take pride in my work, not just for the paycheck, but because I know I'm helping shape the future of our cities. Safety first, quality second, and trust in your team always. This job isn’t easy, but it’s rewarding. If you’re in the field, keep your head up, your tools sharp, and your heart in the work. We’re building more than buildings—we’re building communities.

ID: 0081_human_talk_en_male

Tags: Male,Talking

Talking content: There’s a quiet joy in standing by the water with a fishing rod in hand, watching the ripples and listening to the birds. Life doesn’t need to be rushed—sometimes, the best moments come when you slow down and appreciate the stillness. I’ve spent years tending my garden, fishing by the stream, and reading under the willow tree. These small routines ground me. I believe in kindness, patience, and finding peace in the ordinary. May you too find your own quiet corner of calm.

ID: 0110_human_talk_cn_female

Tags: Female,Talking

Talking content: 夜色中的城市灯火，像无数个未完成的梦想在闪烁。我站在栏杆前，穿着这件黑色旗袍，仿佛与过去与未来对话。我喜欢用镜头记录下那些转瞬即逝的美，也喜欢在舞步中释放内心的节奏。每一段旅程，每一本书，都是灵魂的滋养。我坚信，真正的优雅，来自内心的从容与对生活的热爱。愿你我都能在喧嚣中，找到属于自己的宁静与光芒。

ID: 0337_comic_sing_en_female

Tags: Female,Anime,Singing

Talking content: N/A

ID: 0145_human_talk_en_male

Tags: Male,Talking

Talking content: Consistency is the key to success. Every rep, every set, every workout brings me closer to my goals. I don’t chase perfection—I chase progress. Whether you’re just starting or you’ve been training for years, remember: small efforts compound over time. Stay committed, stay humble, and never underestimate the power of showing up. Your future self will thank you for the discipline you show today. Let’s push harder, stay strong, and keep moving forward.

ID: 0256_comic_talk_en_female

Tags: Female,Talking,Anime

Talking content: I am a young woman from the 17th century, painted by Vermeer. Though my world was quiet and simple, I found beauty in light, color, and the quiet moments of life. I wear my pearl earring not for show, but as a symbol of my inner grace. I dream of a world where art speaks louder than words, where every glance holds meaning. I hope you see not just my face, but the soul behind it — a soul that longs for connection, understanding, and the timeless power of beauty.

ID: 0309_comic_talk_en_female

Tags: Humanoid,Talking

Text prompt: The character in the image is speaking directly to the camera, with a fixed shot.

Talking content: Hello there! I’m Mimi, a fluffy British tabby with eyes like golden suns. I love lounging in the warm sunlight, watching birds from the window, and occasionally playing with my favorite feather toy. I may look serious, but I’m just deep in thought—probably about the best nap spot or the meaning of life. I’m gentle, affectionate, and enjoy quiet moments with my human. If you’re looking for a calm, cuddly companion, I’m your cat. Let’s share some quiet time together, maybe with a little purring and a lot of love.

ID: 0241_human_talk_en_male

Tags: Male,Talking

Talking content: Standing at the edge of the world, I feel the wind whispering ancient stories. Every step on this rugged path has taught me resilience and humility. The mountains don’t care about your past—they only care about your next step. I hike not to escape life, but to find myself in the silence between peaks. Each summit is a reminder: progress isn’t always measured in miles, but in the courage to keep going. To anyone chasing their own summit—breathe deep, move forward, and let the wild shape your soul.

ID: 0293_comic_talk_en_male

Tags: Anime,Talking

Talking content: Hello everyone! I’m Pikachu, and I’m so happy to be here at the funfair today! The Ferris wheel is spinning, the sky is blue, and there are so many colorful flowers around. I love to explore and make new friends. I just had a yummy snack and I’m ready for more adventures! If you see me, give me a big smile — I’ll give you a little electric zap of joy! Let’s have a wonderful day together, full of laughter and fun!

ID: 0118_human_talk_en_male

Tags: Male,Talking,Walking

Talking content: Hello everyone! I’m thrilled to be here, dressed sharp and ready to make a difference. Life is about confidence, purpose, and lifting others as we rise. Whether it’s in business, community, or personal growth, I believe in leading with integrity and a smile. Let’s chase our dreams, support one another, and build a future full of opportunity. Stay positive, stay driven, and never underestimate the power of a well-tailored suit and a kind heart. Let’s go!

ID: 0273_comic_talk_en_unknown

Tags: Humanoid,Talking

Talking content: Hi there! I’m your little orange friend, and I’m so happy to meet you! I love jumping, laughing, and making new friends. My big eyes are always looking for fun adventures, and my smile never stops! I’m full of energy and ready to explore the world with you. Let’s play, dance, and have a great time together! Don’t worry, I’m super friendly and always up for a good laugh. Come on, let’s go on an adventure!

ID: 0169_human_talk_en_male

Tags: Male,Talking

Talking content: I've spent over 25 years serving my community, and every day I remind myself that my duty is not just to enforce the law, but to protect and connect with the people I serve. I believe in justice, fairness, and building trust between the police and the public. Whether it’s helping a lost child, mediating a dispute, or standing guard during a crisis, I do it with honor and humility. I’m proud to be a part of this city’s safety and resilience. Thank you for your trust.

ID: 0015_human_talk_cn_female

Tags: Female,Talking

Talking content: 岁月静好，愿我们都能在忙碌的生活中，找到属于自己的那份宁静与从容。书页翻动的声音，台灯下温暖的光，都是生活给予的温柔馈赠。不必追逐喧嚣，内心的丰盈才是真正的富足。愿你我都能在时光中沉淀智慧，在平凡中感受美好，以温柔之心拥抱每一天。

ID: 0143_human_talk_en_female

Tags: Female,Talking

Talking content: I believe in the beauty of raw moments—like standing in the quiet of nature, letting the wind play with my hair. Life is too short to be anything but authentic. I chase inspiration in sunsets, in music, in the way light dances through leaves. I’m not afraid to be myself, even if it means standing out. I create, I explore, I feel deeply. This is my truth, and I wear it proudly. Let your soul breathe and let your light shine.

ID: 0475_human_sing_cn_female

Tags: Female,Singing

Talking content: N/A

ID: 0205_human_talk_en_male

Tags: Male,Talking

Text prompt: The man in the image is speaking directly to the camera, with a fixed shot.

Talking content: I believe strength is more than what you see on the surface—it’s in the quiet moments, in the discipline, in the way you carry yourself. I train not just to build muscle, but to build character. I capture moments through my lens because I see beauty in raw honesty. Music, especially jazz, speaks to the soul in ways words can’t. I’m not just a man of action—I’m a man of thought, feeling, and purpose. I live intentionally, and I hope to inspire others to do the same.

ID: 0247_human_talk_en_male

Tags: Male,Talking

Talking content: I’m here to celebrate the art of music and fashion! This pink cowboy suit? It’s not just a look — it’s a statement. I believe in being unapologetically yourself, no matter what the world says. Every stitch, every gold accent, every smile — it’s all about confidence and joy. I’m proud to represent boldness and creativity on this red carpet. To everyone chasing their dreams: wear your colors loud, stand tall, and never let fear dim your light. Let’s make the world brighter, one bold choice at a time.

ID: 0011_human_sing_en_male

Tags: Male, Singing

Text prompt: The man in the image is singing while playing the guitar in his hands.

Talking content: N/A

ID: 0282_comic_talk_cn_male

Tags: Humanoid,Talking

Talking content: 你好！我是一只快乐的小柴犬，喜欢每天和主人一起玩耍。我最爱在阳光下打滚，也喜欢追着小球跑来跑去。虽然有时候会调皮，但我总是很听主人的话。我有一双大大的眼睛，能看懂主人的每一个表情。希望你能喜欢我，和我一起度过每一个温暖的日子。我会用我的尾巴摇出最真诚的问候！

ID: 0152_human_talk_cn_female

Tags: Female,Talking

Talking content: 海风拂面，阳光洒在身上，我最爱的时刻就是站在沙滩上，抱着冲浪板，准备迎接海浪的挑战。每一次冲浪都像是一场与自然的对话，自由而充满力量。我热爱色彩，喜欢用鲜艳的裙子表达内心的活力。生活就像冲浪，有时平静，有时汹涌，但只要保持微笑和勇气，就能乘风破浪。希望每个人都能找到属于自己的那片海，勇敢追逐梦想。

ID: 0307_comic_talk_cn_male

Tags: Male,Anime,Talking

Talking content: 春日的暖阳洒在窗棂，樱花轻舞，如诗如画。我独坐于此，品一盏清茶，听风拂过花枝的低语。这世间纷扰，我愿以静心观之，以墨笔绘之。人生如花开花落，不必争抢，只需在属于自己的时节，绽放出最本真的色彩。愿你我皆能守一份宁静，怀一颗赤子之心，在喧嚣中寻得内心的桃花源。

ID: 0037_human_talk_cn_female

Tags: Female,Talking

Talking content: 每个人都是自己故事的主角。无论身处何地，我始终相信，内心的平静与坚定，是面对世界最好的姿态。我喜欢用镜头记录生活中的美好瞬间，也享受在旅途中遇见不同的人和文化。阅读让我在喧嚣中找到宁静，而瑜伽则帮助我与自己的身体和心灵对话。愿你也能在平凡中发现不凡，在忙碌中保持温柔。

ID: 0007_human_sing_cn_female

Tags: Female, Singing

Talking content: N/A

ID: 0413_comic_sing_en_female

Tags: Female,Anime,Singing

Talking content: N/A

ID: 0179_human_talk_en_male

Tags: Male,Talking

Text prompt: The elderly man in the image is standing in front of a lush green plant background, holding a smartphone with both hands, speaking directly to the camera. The shot is static.

Talking content: I find great joy in tending to my plants, both in my garden and in this greenhouse. With the help of my smartphone, I can monitor their growth, check for pests, and even learn new techniques. Technology and nature go hand in hand for me. I believe that staying connected with the natural world, even through digital tools, enriches our lives. Whether I'm watering a seedling or reading an article about soil health, every moment spent with plants brings me peace and purpose.

ID: 0479_comic_sing_cn_female

Tags: Female,Anime,Singing

Talking content: N/A

Comparison with SOTAs on Soul-Bench
Better to display in full screen for detailed presentation.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0057_human_talk_en_male.png

Tags:Male,Walking,Talking

Text prompt:The man in the image walks forward while speaking directly to the camera. The camera slowly pulls back, keeping him centered in the frame. He wears a black leather jacket and holds a black-and-white electric guitar, his expression focused and his steps steady. The background is an alleyway covered in vibrant graffiti, with warm yellow streetlights and neon signs flickering behind him, creating a gritty urban rock atmosphere. Occasionally, he gently strums the guitar strings with his left hand, while his right hand rests naturally on the body of the guitar, as if improvising a performance. The camera remains steady, emphasizing the interaction between the subject and his surroundings.

Talking content:I live for the raw energy of the city — the graffiti, the echoes of distant music, the way light cuts through alleyways at dusk. My guitar is my voice, my escape, my truth. I don’t follow trends; I create them. Every chord I play is a rebellion against silence. I’m not just a musician — I’m a storyteller, a wanderer, a believer in the power of sound to change minds and hearts. The world is loud, but I’m here to make it listen.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0116_human_talk_en_female.png

Tags:Female,Walking,Talking

Text prompt:The woman in the image stands in the center of an elegant hallway, speaking directly to the camera while slowly walking forward. The camera gradually pulls back, keeping her centered in the frame. She is wearing a sophisticated navy blue suit set, paired with a brown silk scarf and light-colored high heels, exuding calm confidence. The hallway features classical-style door frames and marble flooring on both sides, creating an atmosphere of refined elegance and grandeur.

Talking content:I believe confidence is the most powerful accessory one can wear. This suit isn't just clothing—it's a statement of who I am: strong, intentional, and unapologetically myself. I find beauty in structure and elegance in simplicity. Whether I'm walking down a marble hallway or navigating life’s challenges, I carry myself with purpose. I’m not just following trends—I’m creating them. And I do it with grace, intelligence, and a quiet fire that refuses to be ignored.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0007_human_sing_cn_female.png

Tags:Female,Singing

Text prompt:The woman in the image is singing, wearing light blue traditional-style clothing, adorned with an elegant diamond crown, holding a microphone, with a focused and gentle expression. The background features deep blue stage lighting, creating a dreamy atmosphere.

Talking content:N/A

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0335_comic_sing_cn_male.png

Tags:Male,Anime,Singing

Text prompt:The character in the image is singing in a recording studio, wearing professional headphones, with a microphone in front of him. He appears focused, and the background features recording equipment. The overall atmosphere is immersive and professional.

Talking content:N/A

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0041_human_talk_en_male.png

Tags:Male,Talking

Text prompt:The man in the image is speaking directly to the camera, with a fixed shot. The background features nighttime city lights, creating an urban atmosphere. He appears focused, wearing earrings and a dark shirt, exuding a mature and composed demeanor.

Talking content:Life is not about waiting for the storm to pass, but learning to dance in the rain. I’ve walked through cities at night, searching for meaning in the glow of distant lights. Each scar, each decision, has shaped who I am. I believe in honesty, in quiet strength, in the power of a single moment to change everything. Don’t chase perfection—chase purpose. And never forget: your story matters, even when no one’s watching.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0085_human_talk_en_male.png

Tags:Male,Talking

Text prompt:The man in the image is speaking directly to the camera, holding a yellow tape measure steadily with both hands, occasionally tapping or gesturing with the tape to emphasize his explanation. The camera remains fixed. The background features a busy construction site, with a high-rise building under construction, tower cranes, and a yellow excavator visible behind him. The scene is bathed in bright sunlight, creating a realistic and powerful atmosphere.

Talking content:I've spent over 30 years building structures that stand the test of time. Every beam, every brick, every measurement matters. I take pride in my work, not just for the paycheck, but because I know I'm helping shape the future of our cities. Safety first, quality second, and trust in your team always. This job isn’t easy, but it’s rewarding. If you’re in the field, keep your head up, your tools sharp, and your heart in the work. We’re building more than buildings—we’re building communities.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0204_human_talk_en_male.png

Tags:Male,Walking,Talking

Text prompt:The man in the image is walking forward while speaking to the camera. The camera slowly pulls back, keeping him centered in the frame. He is wearing a white hard hat and carrying a black-and-yellow toolbox. Behind him is a room under renovation, with a ladder and a level placed nearby.

Talking content:I’m proud to be part of the transformation of this space. Every tool, every measurement, and every decision matters. I love the process of turning raw potential into something beautiful and functional. Whether it’s a new wall, a fresh coat of paint, or a reimagined layout, I take pride in doing it right. This isn’t just construction—it’s creation. And I’m excited to see the final result. Safety first, always. Let’s build something great.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0428_human_sing_cn_female.png

Tags:Male,Singing

Text prompt:The character in the image is singing, with a soft and blurred background. She softly hums a cheerful pop song, her eyes bright and full of emotion, as if sharing a beautiful moment with the audience. She wears pearl earrings and a pearl necklace, dressed elegantly and refined. The overall atmosphere is warm and dreamy, as if sitting by a sunlit window, conveying warmth and hope through her singing.

Talking content:N/A

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0327_comic_sing_cn_female.png

Tags:Female,Anime,Singing

Text prompt:The character in the image is singing while DJing, operating the DJ controller with both hands, wearing pink headphones, and focused on adjusting the music rhythm. The background features a vibrant electronic music scene filled with neon lights, creating an energetic nightclub atmosphere.

Talking content:N/A

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0125_human_talk_en_male.png

Tags:Male,Talking

Text prompt:The man in the frame is speaking directly to the camera, smiling while adjusting his bow tie with one hand. His right hand hangs naturally by his side, while his left hand holds a white suit jacket draped over his shoulder. A white boutonniere is pinned to the jacket. The camera remains stationary.

Talking content:Today is a beautiful day filled with joy, love, and celebration. I’m honored to be part of this special occasion, dressed in my finest, ready to share in the happiness of the couple. Life is too short to not celebrate love, and I’m here to embrace every moment with a smile. May this day be the beginning of a lifetime of happiness, laughter, and unforgettable memories. Let’s dance, laugh, and cherish every second together!

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0273_comic_talk_en_unknown.png

Tags:Humanoid,Talking

Text prompt:The character in the image is speaking directly to the camera, occasionally making gestures that match the content of their speech, with a fixed camera angle. This fluffy orange little monster is grinning widely, showing neat teeth, with round eyes blinking expressively. Its hands hang naturally at its sides, occasionally lifting excitedly to make a "V" sign or gently patting its little chest, appearing lively and adorable. The background is a warm orange-yellow, emphasizing the character's fluffy texture and vibrant colors.

Talking content:Hi there! I’m your little orange friend, and I’m so happy to meet you! I love jumping, laughing, and making new friends. My big eyes are always looking for fun adventures, and my smile never stops! I’m full of energy and ready to explore the world with you. Let’s play, dance, and have a great time together! Don’t worry, I’m super friendly and always up for a good laugh. Come on, let’s go on an adventure!

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0029_human_talk_en_female.png

Tags:Female,Talking

Text prompt:The woman in the image is speaking directly to the camera, occasionally using her right hand to gently brush her hair aside, with a fixed camera shot.

Talking content:I believe in the power of quiet strength and thoughtful action. Life is not about rushing through moments, but about truly experiencing them. I find inspiration in the details — a window’s reflection, a stranger’s smile, the way light falls on a city street. I’m passionate about creating meaning in my work and my relationships. I value authenticity and strive to live with purpose. Whether I’m capturing a moment through my lens or navigating a new city, I’m always learning, growing, and embracing the journey.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0036_human_talk_en_male.png

Tags:Male,Talking

Text prompt:The man in the image is standing in a bright office, speaking directly to the camera. He holds a pen in his right hand and wears a silver watch on his left wrist, occasionally using the pen to gesture in sync with his speech. The camera remains fixed. The background features floor-to-ceiling windows and a bookshelf, with city buildings visible outside. Inside, there are green plants and a desk, creating an overall professional and composed atmosphere.

Talking content:Success isn't about luck—it's about preparation, consistency, and integrity. Every decision I make is rooted in long-term vision, not short-term gain. I value precision, whether it's in my work or my personal life. I believe in leading by example, mentoring others, and staying committed to excellence. Challenges are not obstacles—they're opportunities to grow. Stay focused, stay ethical, and never underestimate the power of a well-planned strategy. That’s how you build lasting impact.

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0075_human_talk_cn_female.png

Tags:Female,Talking

Text prompt:The woman in the image is speaking directly to the camera, holding a bouquet of fresh flowers. Occasionally, she gently taps the bouquet with her fingers or slightly turns her head. The camera remains stationary.

Talking content:生活中最美的瞬间，往往藏在那些不经意的细节里。一朵花的绽放，一缕阳光的洒落，都能让我感受到世界的温柔。我喜欢用镜头记录下这些美好，也喜欢用双手为他人编织花束，传递一份心意。希望每个人都能在忙碌中停下脚步，发现身边的诗意与温暖。愿你我都能像花一样，不争不抢，却自有芬芳。

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0457_human_sing_cn_female.png

Tags:Female,Singing

Text prompt:The character in the image sings while playing the musical instrument in her hands, wearing elaborate ethnic attire, adorned with intricate silver jewelry and a colorful furry headdress, with two braids hanging over her shoulders. The background features vast grasslands under a bright blue sky with white clouds, with sunlight shining upon her, creating a dreamy and richly ethnic atmosphere.

Talking content:N/A

Soul

Sonic

Wan-S2V

InfiniteTalk

StableAvatar

OmniAvatar

ID:0464_human_sing_cn_female.png

Tags:Female,Singing

Text prompt:The character in the image is singing, wearing an elegant diamond tiara, a white gown, and exquisite necklace and earrings, holding a microphone, with a focused and gentle expression, as if passionately performing on stage.

Talking content:N/A

Leaderboard on Soul-Bench

More results of the latest methods will be kept updated.

ID	Method	Video-Text Consistence↑	LSE-D↓	LSE-C↑	Identity Consistence↑	Video Quality↑	Audio-Video Alignment↑
1	Soul	4.85	0.130	6.82	0.763	72.60	0.255
2	Sonic	4.57	0.663	7.80	0.613	68.58	0.191
3	Wan-S2V	4.74	5.455	6.71	0.750	71.22	0.330
4	InfiniteTalk	4.75	2.313	8.48	0.609	68.53	0.211
5	StableAvatar	4.77	3.948	4.05	0.733	71.40	0.250
6	OmniAvatar	4.77	1.009	5.84	0.497	67.24	0.225

BibTeX

@misc{soul, title={Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation}, author={Jiangning Zhang and Junwei Zhu and Zhenye Gan and Donghao Luo and Chuming Lin and Feifan Xu and Xu Peng and Jianlong Hu and Yuansen Liu and Yijia Hong and Weijian Cao and Han Feng and Xu Chen and Chencan Fu and Keke He and Xiaobin Hu and Chengjie Wang}, year={2025}, eprint={2512.13495}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2512.13495}, }

Soul: Breathe Life into Digital Human for
High-fidelity Long-term Multimodal Animation

High-lights

Brief introduction for Soul, Soul-1M, and Soul-Bench.

Enjoy animation results of Soul on Soul-Bench

Visualization of Soul-Bench

Comparison with SOTAs on Soul-Bench
Better to display in full screen for detailed presentation.

Leaderboard on Soul-Bench

Structure of Soul

Statistic of Soul-1M

Statistic of Soul-Bench

Comparison with SOTAs

BibTeX

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

High-lights

Brief introduction for Soul, Soul-1M, and Soul-Bench.

Enjoy animation results of Soul on Soul-Bench

Visualization of Soul-Bench

Comparison with SOTAs on Soul-Bench Better to display in full screen for detailed presentation.

Leaderboard on Soul-Bench

Structure of Soul

Statistic of Soul-1M

Statistic of Soul-Bench

Comparison with SOTAs

BibTeX

Soul: Breathe Life into Digital Human for
High-fidelity Long-term Multimodal Animation

Comparison with SOTAs on Soul-Bench
Better to display in full screen for detailed presentation.