Intro to Oshuur: AI Splintershorts | Splinterlands

avatar
(Edited)

image.png


For my second try at doing a video entirely powered by AI, I decided to go with a card introduction, and went with one of my all-time favorite @splinterlands cards: Oshuur Constantia. As before, here's the final result before going into my process:

Process

This video was a little bit simpler than the first, as my "storyboard" required less action. I decided to do this because I realized that while Runway is a very powerful AI video tool, it's still quite difficult to create precise and high-quality motion (or at least it's not something I've quite figured out). Also, I saw Runway had a "lip sync" feature and wanted to try it out.

The steps were as follows:
(1) Creating high fidelity images in Midjourney
(2) Generating audio in Runway (for lip sync & non lip sync parts)
(3) Lip syncing audio to image to create a speaking video in Runway
(4) Generating other videos in Runway
(5) Stitching everything together in Clipchamp

Midjourney

As before, I used Midjourney to create my images. This time I skipped ChatGPT and went straight to Midjourney, using the character reference feature to get MJ's help in generating high-fidelity images that closely match the original art.

image.png


One new thing I discovered was that using the "--cw" parameter, which determines the character weight, it can make it difficult to make a photorealistic rendering of the character if the reference image is an illustration. This can be balanced by adding a photorealistic image as a secondary "style" reference (the paperclip, instead of the person icon). The "style" weight parameter is "--iw" and goes from 0 to 3, with a default value of 1.

Ultimately however I was able to get the images I wanted by doing a series of experiments and occasionally alternating my character reference image in the prompt. Keeping "--cw 100" (100 is the max for character reference) ensures that MJ maintain consistency with the original image (style, clothing, colors, etc.).

image.png

For the image used for the lip sync, I created an image where Oshuur was looking straight at the camera, as my attempts to add lip movements to other images (e.g. where she's slightly looking to the side) failed miserably.

Creating the audio & videos

Runway has a "Generative Audio" feature that allows you to select from a library of voices. You simply provide a script, create generate, and it does the rest. The voices are extremely natural and lifelike.

image.png

image.png


Lip-syncing is then quite simple, and just requires adding the image you want to animate (in my case the one where Oshuur is staring right into the camera) and selecting the audio file you just created.

image.png

After this, I went on to create a few more videos using alternate art for the frames where Oshuur is speaking in the background.

image.png

As with the last video, I used Runway's Gen-3 Alpha Turbo model which does a good job of quickly create high fidelity videos from the provided images.

Stitching it together in Clipchamp

As before, I opted to use Clipchamp which is simple and easy to use. With it I'm able to quickly put together the image, video, and audio files together.

image.png

Conclusion & Resources

This video took me a total of about 2-2.5 hours. It would've been much less (about an hour) but at the last minute I decided to add extra frames where Oshuur describes her abilities, and also ran into an issue with the audio (described below). Overall though it was easier because there was considerably less motion and a much simpler storyboard.

Resources used:

  • Midjourney to create consistent high fidelity images (https://www.midjourney.com/ )
  • Runway to animate the scenes (https://app.runwayml.com/ )
  • Clipchamp to stitch the video together
  • Youtube music library for royalty-free music (https://www.youtube.com/audiolibrary ); Unfortunately while attempting to use music from Pixabay, even though it was marked as "royalty-free", it triggered the copyright check on YouTube, so I had to switch out the track I originally had with one from the Youtube music library (which isn't bad, but is rather limited)

Hope you enjoyed this post and the resulting video. Let me know if you have ideas for other videos you think I should try to do, and let me know if you find this one interesting & useful.



0
0
0.000
28 comments
avatar

So rad dude. Can't quite put my finger on which nostalgic heart strings this video is tuggin on, but boy it's beautiful. Another one well done, BT!

avatar

Ha, that's awesome dude. Chrono Trigger perhaps, what with the steampunk vibe? Or one of the early Final Fantasy games? Those are kind of what I felt after setting the music and watching the finished product.

Thanks for dropping by TPG!

avatar

FF vibes were definitely in there - the music is what really took it home for me. I think half of the views on YT are me! Haha. I really love what ya dabbling into here with this pursuit of AI greatness, Tof! Love it man.

avatar

Here's the full song if you'd like to play it in the background without Oshuur talking :)

avatar

Congratulations @bravetofu! You have completed the following achievement on the Hive blockchain And have been rewarded with New badge(s)

You made more than 600 comments.
Your next target is to reach 700 comments.

You can view your badges on your board and compare yourself to others in the Ranking
If you no longer want to receive notifications, reply to this comment with the word STOP

avatar

Very cool! You should try to generate a video that promotes Splinterlands using AI more directly :)

avatar
(Edited)

Um.... wow. Next level dude! Seriously wow, I'm blown away. This is amazing and I'm so excited to see what else you can do. Like for real, most of the AI art we've been seeing lately does not look super awesome. THIS looks like 99% super awesome. Like I almost feel like there are moments when it looks on par with some of my favorite current shows, so like dude, yeah. This is next level. Awesome job! I'll be excited to see more and I also followed your youtube channel!

avatar

Thanks @star-shroud, I'm glad you like it! It's a lot of fun to make these. Definitely a lot of kinks in the process that I'm still figuring out, but it's a fun journey :)

avatar
(Edited)

For Pixaby sound effect, dont even worry! Youtube WILL flag it, but still let you use it. Promise. I use pixaby like its my own garage of sounds! It's the best ! Yes youtube will flag, but they just say "this is copyrighted" to tell you. There's no penalty and you can still post it! Pixaby is amazing.

avatar

Haha ok I'll try it again next time!

avatar

I'm excited to look at "Runway" and see how you did your animation, its really really good!

avatar

Good luck and looking forward to see what you make with it!

avatar

Fun video, thanks for sharing. I assumed that Oshuur had a deeper, more commanding voice. Must have confused her for Kelya.

avatar

Ha, yeah I tried 6 or 7 different voices, but for some reason this one stuck with me. I wouldn't have made it quite so high pitched but the tone was what felt the most accurate.

avatar

Tone is the most important... spoken like a true teacher!

avatar

Excellent. This is one mosaic which new players need in SPL. Awesome job, bro!

avatar

Oh otro diseñador que usa I.A e ingeniera en imagenes. Te felicito mi hermano, muy buen trabajo.