My favorite test is to ask the image model to do an overhead view of a baseball field and put an apple in left field near the warning track. Have yet to find one that can do it. Not to mention if you ask it to move the apple from - usually center - to the right side of the image. I've found nothing that can manipulate a specific object i…
My favorite test is to ask the image model to do an overhead view of a baseball field and put an apple in left field near the warning track. Have yet to find one that can do it. Not to mention if you ask it to move the apple from - usually center - to the right side of the image. I've found nothing that can manipulate a specific object in the first prompt. Text can be rough. Images can be just junk.
My favorite test is to ask the image model to do an overhead view of a baseball field and put an apple in left field near the warning track. Have yet to find one that can do it. Not to mention if you ask it to move the apple from - usually center - to the right side of the image. I've found nothing that can manipulate a specific object in the first prompt. Text can be rough. Images can be just junk.
With that in mind, how do you think video does? Audio is trying, but.....