Skip to content

[HANDS-ON BUG] #517

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
micdestefano opened this issue May 28, 2025 · 0 comments
Open

[HANDS-ON BUG] #517

micdestefano opened this issue May 28, 2025 · 0 comments

Comments

@micdestefano
Copy link

Describe the bug
The correct answer to the question on the youtube video with penguins and seagull is 2, but the automatic verification marks it as wrong.

To Reproduce
Try submitting 2 for the question on the youtube video with penguins.

Additional context
The qustion asks how many different bird species can be seen at the same time. The correct answer is 2, because we can at most see penguins and a seagull at the same time. The fact that in some frames we see small penguins and an adult penguin (probably of another race, but for sure not of another species ... because it is a penguin too) together with a seagull does not imply that the correct answer is 3. My agent is answering 2 and I'm convinced that this is the correct answer.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant