- 博客
- 分类
- 标签
- 归档
- 随笔
- 日报

transformer模型介绍

zhiqiuyuan

发布于：Oct 13, 2023

Large-scale pre-trained models (PTMs), such as Transformer models, have promoted the deep learning (DL) development on various complicated tasks, including natural language processing, e.g., BERT [9], GPT [6], T5 [41], computer vision, e.g., ViT [10], Swin [25], advertising recommendation, e.g., M6 [24], and so on.
These models are also known as foundation models since they are trained on hundreds of gigabytes of data and can be adapted, e.g., task-specific fine-tuning, to a wide range of downstream tasks

FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement

https://zhuanlan.zhihu.com/p/338817680

更新于：Oct 13, 2023

chrome查看网页cookie

F12-network-Doc-你要查看的网页的Name条目（在network界面有）（如果没有数据，刷新一下界面）-Headers可以看到有cookie等信息左键三次可以选中cookie信息

virtualbox解决桥接网卡无法上网的问题

https://blog.csdn.net/a469517790/article/details/80747383 在PC中，为虚拟机上网的网卡（可能和桥接的网卡不一样），设置成与实际上PC上网...

评论

粘贴文本
全选文本
剪切文本
复制文本
站内搜索
必应搜索
新标签页打开
复制链接地址
复制图片
谷歌识图
常见问题
示例博客
加入社区

本站源码
主题源码

暗黑模式
打印页面
阅读模式