as of now, no threshold but that is planned in the future.
for example, for now if i search "cybertruck" in my indexed dashcam footage, i don't have any cybertrucks in my footage, so it'll return a clip of the next best match which is a big truck, but not a cybertruck
dashcam and home security footage are the 2 main ones i can think of.
a bit expensive right now so it's not as practical at scale. but once the embedding model comes out of public preview, and we hopefully get a local equivalent, this will be a lot more practical.
gemini embedding 2 converts straight video to vectors. in this case, dashcam clips don't have audio to transcribe and even if they did, it would be useless in the search
Very interesting (not for a dashcam, but for home monitoring).
Today I learned that Gemini can now natively embed video..
Cool Project, thanks for sharing!
Where is the Exit to this dystopia?
That's quite interesting, well done! I haven't thought of this use case for embeddings. It open the door to quite many potential applications!
Man, the surveillance applications for this are staggering.
Nice use of native video embedding. How do you handle cases where Gemini's response confidence is low? Do you have a fallback or threshold?
as of now, no threshold but that is planned in the future.
for example, for now if i search "cybertruck" in my indexed dashcam footage, i don't have any cybertrucks in my footage, so it'll return a clip of the next best match which is a big truck, but not a cybertruck
very cool, anybody have apparent use cases for this?
dashcam and home security footage are the 2 main ones i can think of.
a bit expensive right now so it's not as practical at scale. but once the embedding model comes out of public preview, and we hopefully get a local equivalent, this will be a lot more practical.
State surveillance
why not skip the text conversion? is it usable at all?
gemini embedding 2 converts straight video to vectors. in this case, dashcam clips don't have audio to transcribe and even if they did, it would be useless in the search
What are the SoA audio models right now?