sequence model coursera week 2 quiz answers

Quiz - Natural Language Processing & Word Embeddings

1. Suppose you learn a word embedding for a vocabulary of 10000 words. Then the embedding vectors could be 10000 dimensional, so as to capture the full range of variation and meaning in those words.

Answers

2. What is t-SNE?

Answers

3. Suppose you download a pre-trained word embedding which has been trained on a huge corpus of text. You then use this word embedding to train an RNN for a language task of recognizing if someone is happy from a short snippet of text, using a small training set.

True/False: Then even if the word “upset” does not appear in your small training set, your RNN might reasonably be expected to recognize “I’m upset” as deserving a label y = 0.

Answers

4. Which of these equations do you think should hold for a good word embedding? (Check all that apply)

Answers

5. Let A be an embedding matrix, and let 04567 be a one-hot vector corresponding to word 4567. Then to get the embedding of word 4567, why don't we call A * 04567 in Python?

Answers

6. When learning word embeddings, we pick a given word and try to predict its surrounding words or vice versa.

Answers

7. In the word2vec algorithm, you estimate P(t | c), where t is the target word and c is a context word. How are t and c chosen from the training set? Pick the best answer.

Answers

8. Suppose you have a 10000 word vocabulary, and are learning 100-dimensional word embeddings. The word2vec model uses the following softmax function:

True/False: After training, we should expect O, to be very close to e, when t and care the same word.

Answers

9. Suppose you have a 10000 word vocabulary, and are learning 500-dimensional word embeddings. The GloVe model minimizes this objective:

True/False: X; is the number of times word j appears in the context of word i.

Answers

10. You have trained word embeddings using a text dataset of m1 words. You are considering using these word embeddings for a language task, for which you have a separate labeled dataset of my words. Keeping in mind that using word embeddings is a form of transfer learning, under which of these circumstances would you expect the word embeddings to be helpful?

Answers

sequence model coursera week 2 quiz answers

Quiz - Natural Language Processing & Word Embeddings

1. Suppose you learn a word embedding for a vocabulary of 10000 words. Then the embedding vectors could be 10000 dimensional, so as to capture the full range of variation and meaning in those words.

2. What is t-SNE?

3. Suppose you download a pre-trained word embedding which has been trained on a huge corpus of text. You then use this word embedding to train an RNN for a language task of recognizing if someone is happy from a short snippet of text, using a small training set.

True/False: Then even if the word “upset” does not appear in your small training set, your RNN might reasonably be expected to recognize “I’m upset” as deserving a label y = 0.

4. Which of these equations do you think should hold for a good word embedding? (Check all that apply)

5. Let A be an embedding matrix, and let 04567 be a one-hot vector corresponding to word 4567. Then to get the embedding of word 4567, why don't we call A * 04567 in Python?

6. When learning word embeddings, we pick a given word and try to predict its surrounding words or vice versa.

7. In the word2vec algorithm, you estimate P(t | c), where t is the target word and c is a context word. How are t and c chosen from the training set? Pick the best answer.

8. Suppose you have a 10000 word vocabulary, and are learning 100-dimensional word embeddings. The word2vec model uses the following softmax function:

True/False: After training, we should expect O, to be very close to e, when t and care the same word.

9. Suppose you have a 10000 word vocabulary, and are learning 500-dimensional word embeddings. The GloVe model minimizes this objective:

True/False: X; is the number of times word j appears in the context of word i.