Granular Analysis of Transformer
I want to write this blog to save some widely observed conclusions with Transformer Network [1]:
I want to write this blog to save some widely observed conclusions with Transformer Network [1]:
In addition to the six challenges listed in [1], here’re the other well-known problems with Neural Machine Translation:
Paper writing is a crucial point for research, and I’m pretty bad at it. From my first paper experience, I learned a lot (mostly from my advisor) during paper revision. My bullets are as follows.
This is my first notes w.r.t. differential geometry study, from the book “first steps in differential geometry”, This is only the linear algebra part. Noted: this book is pretty succint in Linear Algebra part, so I’ll try to understand it by my own.
This is my first time writing a blog. Hello world.