Skip to content

Issues: baichuan-inc/Baichuan-7B

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

[Question] 增量预训练,损失有点高,这正常吗,还是哪里出问题了? question Further information is requested
#148 opened Aug 7, 2024 by datongzi666
5 tasks done
[Question] 安装依赖时终端报错(deepspeed) question Further information is requested
#146 opened May 11, 2024 by duolaBmeng673
5 tasks done
[Question] 微信群的二维码失效了 question Further information is requested
#145 opened May 6, 2024 by yzhao-2023
5 tasks done
[Question]不能安装xformers question Further information is requested
#144 opened May 2, 2024 by Acid-uncoin
4 of 5 tasks
baichuan2和baichaun2-7B这俩仓库有啥区别吗 question Further information is requested
#141 opened Feb 1, 2024 by fxb392
5 tasks done
想问一下在A800上测试的吞吐量,换算到推理速度的话有多少tokens/s? question Further information is requested
#138 opened Nov 21, 2023 by HJT9328
5 tasks done
[Typo] question Further information is requested
#137 opened Oct 24, 2023 by Chandler-Bing
5 tasks done
[Question] RoPE的实现和论文里不一致 question Further information is requested
#136 opened Oct 4, 2023 by zehmaaa
5 tasks done
[Question] 可以提供模型的国内下载源吗 question Further information is requested
#134 opened Sep 18, 2023 by liulfy
5 tasks done
[BUG] CUDA Out of Memory when eval model. bug Something isn't working
#133 opened Sep 12, 2023 by Crystalxd
5 tasks done
[Question] DeepSpeed Zero3 save_checkpoint() got empty mode_states files question Further information is requested
#132 opened Sep 11, 2023 by mynewstart
5 tasks done
能提供个类似open_api.py的文件,可以供我们使用接口进行测试吗? question Further information is requested
#131 opened Sep 11, 2023 by mawenju203
5 tasks done
[Question] 请问7B没有用上FlashAttention吗? question Further information is requested
#130 opened Sep 7, 2023 by nezhazheng
5 tasks done
[Evaluation] 提供 Baichuan 模型在 OpenCompass 上的评测结果 question Further information is requested
#128 opened Sep 6, 2023 by Leymore
4 of 5 tasks
[Question] Baichuan-7B多GPU 原生部署、 int8 和 int4 量化部署 question Further information is requested
#127 opened Aug 29, 2023 by potong
5 tasks done
[Question] 关于数据处理的疑问 question Further information is requested
#124 opened Aug 22, 2023 by mynewstart
5 tasks done
我要做预训练通用模型,样本数据加载这里可以给个demo数据? question Further information is requested
#121 opened Aug 16, 2023 by wangweihua11
5 tasks done
请问想接上下句古诗 需要怎么写提示词? question Further information is requested
#120 opened Aug 15, 2023 by goog
5 tasks done
pretrain learning rate is le-8? question Further information is requested
#119 opened Aug 12, 2023 by hegang1-tal
5 tasks done
请问部署后,如何通过API调用? question Further information is requested
#118 opened Aug 12, 2023 by lemon-simple
5 tasks done
[Question] 你好,训练分词模型的代码可以分享吗?或者有什么参考吗? question Further information is requested
#117 opened Aug 12, 2023 by StarrySeas1
5 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.