AInvest Newsletter
Daily stocks & crypto headlines, free to your inbox
Alphabet Inc., the parent company of
, has announced the full release of a new feature for its Gemini AI assistant, allowing paid users to convert photos into short videos. This feature, initially tested on a small scale earlier this year, is now fully integrated into the Gemini chat interface. Users can generate an 8-second video with sound based on a single photo and a text description. The resulting video is in MP4 format with a resolution of 720p and a 16:9 aspect ratio.This update represents a significant enhancement to Gemini's capabilities, leveraging advanced AI to transform static images into dynamic content. The integration of this feature directly into the Gemini chat interface streamlines the user experience, making it easier for paid subscribers to create engaging video content from their photos. This move underscores Google's commitment to expanding the functionalities of its AI tools, providing users with more creative options and enhancing the overall user experience.
The introduction of this feature is part of a broader strategy by Google to enhance its AI offerings and stay competitive in the rapidly evolving tech landscape. By allowing users to convert photos into videos, Google is tapping into the growing demand for multimedia content, which is increasingly popular across various social media platforms. This feature not only adds value for paid users but also positions Gemini as a versatile tool for content creation, potentially attracting more users to its platform.
The ability to generate videos from photos with sound and high resolution opens up new possibilities for users, whether they are content creators, marketers, or individuals looking to enhance their personal projects. The 8-second duration and 720p resolution ensure that the videos are of high quality and suitable for sharing on social media, making it a practical tool for a wide range of applications. The 16:9 aspect ratio further enhances the viewing experience, making the videos more engaging and visually appealing.
This update is a testament to Google's ongoing efforts to innovate and improve its AI technologies. By continuously adding new features and enhancing existing ones, Google is ensuring that its AI tools remain at the forefront of technological advancements. The integration of the photo-to-video conversion feature into the Gemini chat interface is a strategic move that not only benefits users but also reinforces Google's position as a leader in AI development. As AI continues to play a crucial role in various industries, Google's commitment to innovation will be key to its success in the long run.
Google has emphasized that it has taken important backend measures to ensure that the generated videos comply with regulations. For example, the use of images of public figures, including celebrities, politicians, and prominent entrepreneurs, is prohibited for video generation. The policy also prohibits content that incites dangerous behavior, violence, or group attacks. However, testing has shown that the technology still has some flaws. Media testing on the Gemini web version found that when uploading personal photos to generate videos of people speaking, the output often changed facial features and even ethnicity. While simple commands like "plants swaying in the wind" or "static cat talking" could be successfully executed, more complex commands like "photo person doing a backflip" only resulted in the person waving their hands.
In response to the test results, a Google spokesperson stated that the AI model does not have instructions to modify a person's appearance. The conversion of photos to videos and facial animations are still new technologies that may generate results that do not match the original content based on a single image. The model is more adept at animating other scenarios, such as daily objects, artwork, and natural photos with added motion effects. The company will continue to improve various functions, including facial animations, in future updates.

Global insights driving the market strategies of tomorrow.

Sep.28 2025

Sep.27 2025

Sep.26 2025

Sep.26 2025

Sep.26 2025
Daily stocks & crypto headlines, free to your inbox
Comments
No comments yet