ml2 logo

NYU Labeling task on Hybrid

QQP FAQ

Instructions for QQP can be found here!

Thanks for doing our HITs! With your help, we think we’ll be able to build some pretty exciting technologies to help computers better understand human language.

Why is there a training phase?

Sometimes this task can be tricky and we want you to get a sense of what the task is before you work on the main project. We already have labels for these examples, and what we’re doing here is gathering data to understand how well people perform on the task when they have some instructuions and a little bit of training.

Can I immediately start working on the main project after completing training?

Unfortunately no, there is no automatic way for us to add you to our qualified list of workers. We go through the submitted HITs on the training task at least once a day and add worker IDs to the qualified list. Once your name is on the list, you will be notified thruugh Hyrbid and will be able to start working on the main project.

Should the not same label be more common than same?

Ideally, yes. We already have labels for all of these pairs, and we know that there are more not same labeled pairs. So if you find yourself assigning slightly more not same labels, don’t be alarmed. If your responses skew more towards same, reconsider how you are evaluating the prompts.

Will you reject my work?

No. Unless it’s clear to us that you are assigning labels across many HITs without even considering the prompts, we won’t reject any of your work.

Where do these question pairs come from?

Each pair of sentences was written by two different people on an online question-answering forum.

When should I fill out the ‘problems’ field?

You should fill this out if you can’t complete the HIT, and not otherwise. This could be if the HIT interface is partially broken (an empty page, for example). If there is a typo in a sentence, but you think you know what it means anyway, please don’t report it. Never put anything in this field if there isn’t a problem.

Who are you?

We are busy graduate students with the Bowman Group, a subgroup of the ML2 group at New York University Center for Data Science. We are also affiliated with the NYU Departments of Data Science, Computer Science, and Linguistics.

I have more questions!

Email us through Crystal (crystal.phoenixsystems@gmail.com)! Or email through Hybrid!