所有标签

#machine learning ¹ #solutions ⁶⁸ #d2l ⁶⁸ #pytorch ⁶⁸ #异常解决 ⁷ #analysis ³ #risk management ¹ #vps ⁶ #羊毛 ³

15.7. Word Similarity and Analogy

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. Test the fastText results using TokenEmbedding(‘wiki.en’). import os import torch fro

narcissuskid 发布于 2023-11-30

Deep Learning #solutions #d2l #pytorch

15.5. Word Embedding with Global Vectors (GloVe)

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. If words w_i and w_j co-

narcissuskid 发布于 2023-11-30

Deep Learning #solutions #d2l #pytorch

15.6. Subword Embedding

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. As an example, there are about 3\times 10^8 possible 6-grams in English. What is the

narcissuskid 发布于 2023-11-30

Deep Learning #solutions #d2l #pytorch

15.4. Pretraining word2vec

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. Using the trained model, find semantically similar words for other input words. Can y

narcissuskid 发布于 2023-10-19

Deep Learning #solutions #d2l #pytorch

15.3. The Dataset for Pretraining Word Embeddings

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. How does the running time of code in this section changes if not using subsampling? i

narcissuskid 发布于 2023-09-27

Deep Learning #solutions #d2l #pytorch

15.2. Approximate Training

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. How can we sample noise words in negative sampling? Negative sampling is a technique

narcissuskid 发布于 2023-09-27

Deep Learning #solutions #d2l #pytorch

15.1. Word Embedding (word2vec)

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. What is the computational complexity for calculating each gradient? What could be the

narcissuskid 发布于 2023-09-27

Deep Learning #solutions #d2l #pytorch

11.9. Large-Scale Pretraining with Transformers

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. Is it possible to fine-tune T5 using a minibatch consisting of different tasks? Why o

narcissuskid 发布于 2023-09-11

Deep Learning #solutions #d2l #pytorch

11.8. Transformers for Vision

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. How does the value of img_size affect training time? The value of img_size affects th

narcissuskid 发布于 2023-09-11

Deep Learning #solutions #d2l #pytorch

11.7. The Transformer Architecture

github: https://github.com/pandalabme/d2l/tree/main/exercises 1. Train a deeper Transformer in the experiments. How does it affect the training speed

narcissuskid 发布于 2023-09-11

菜单

Different Perspective of Line Regression

所有标签

15.7. Word Similarity and Analogy

15.5. Word Embedding with Global Vectors (GloVe)

15.6. Subword Embedding

15.4. Pretraining word2vec

15.3. The Dataset for Pretraining Word Embeddings

15.2. Approximate Training

15.1. Word Embedding (word2vec)

11.9. Large-Scale Pretraining with Transformers

11.8. Transformers for Vision

11.7. The Transformer Architecture

Different Perspective of Line Regression