A Google natív képgenerálást tett elérhetővé a Gemini 2.0 Flash fejlesztőknek

2025. március 13. · MI Történik? · 1 perc olvasás

Google has released native image generation capabilities specifically for developers. Unlike traditional tools that call a separate model like DALL-E or Imagen, native generation allows the same model to generate both text and images. This integration enables features like natural and selective editing, where specific parts of an image can be modified without altering the entire composition.

Access it via Google AI Studio by selecting the 'Gemini 2.0 Flash Experimental' model
Enables natural and selective editing of images without changing the whole picture
Google AI Studio now also supports asking questions about YouTube videos directly rather than just the transcript

Miért fontos?

Native multimodal generation makes models feel less 'dumb' by allowing them to handle text and imagery within the same context, leading to more precise editing and creative control.

Eredeti forrás megtekintése (angol) →

Kapcsolódó hírek

Az OpenAI új eszközöket és SDK-t mutatott be autonóm AI ágensek építéséhez

2025. március 13.

Az a16z közzétette a 100 legnépszerűbb fogyasztói AI alkalmazás negyedik kiadását

2025. március 11.

A Mistral új modellt adott ki a hagyományos OCR rendszerek kiváltására

2025. március 11.

Tudj meg többet

Gemini a Gmail-ben és a Google Docs-ban: Így automatizáld a munkád

Gemini AI: A Google mesterséges intelligenciája közérthetően