-
Notifications
You must be signed in to change notification settings - Fork 504
Issues: baichuan-inc/Baichuan-7B
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Question] 增量预训练,损失有点高,这正常吗,还是哪里出问题了?
question
Further information is requested
#148
opened Aug 7, 2024 by
datongzi666
5 tasks done
[Question] 安装依赖时终端报错(deepspeed)
question
Further information is requested
#146
opened May 11, 2024 by
duolaBmeng673
5 tasks done
[Question] 微信群的二维码失效了
question
Further information is requested
#145
opened May 6, 2024 by
yzhao-2023
5 tasks done
[Question]不能安装xformers
question
Further information is requested
#144
opened May 2, 2024 by
Acid-uncoin
4 of 5 tasks
[BUG] 我下载了huggingface上的baichuan7b模型,使用 里面的测试程序测试发现CUDA错误
bug
Something isn't working
#143
opened Apr 18, 2024 by
QIANXUNZDL123
5 tasks done
[Question] 参数合并后有什么要注意的吗? 我将7B参数和微调参数合并之后,加载新模型,显存占用超过了24G,这个跟原始7B所需显存差很多?这会是什么导致的
question
Further information is requested
#142
opened Feb 27, 2024 by
Micla-SHL
5 tasks done
baichuan2和baichaun2-7B这俩仓库有啥区别吗
question
Further information is requested
#141
opened Feb 1, 2024 by
fxb392
5 tasks done
[Question] Baichuan-Text-Embedding can be open for open source or have api to use or pay for use? thanks
question
Further information is requested
#140
opened Jan 4, 2024 by
Yazooliu
5 tasks done
[Question] 我想用 Baichuan-7B来开发中文文本纠错功能,主要是错别字,请问下可行性?
question
Further information is requested
#139
opened Dec 25, 2023 by
suchstar
5 tasks done
想问一下在A800上测试的吞吐量,换算到推理速度的话有多少tokens/s?
question
Further information is requested
#138
opened Nov 21, 2023 by
HJT9328
5 tasks done
[Typo]
question
Further information is requested
#137
opened Oct 24, 2023 by
Chandler-Bing
5 tasks done
[Question] RoPE的实现和论文里不一致
question
Further information is requested
#136
opened Oct 4, 2023 by
zehmaaa
5 tasks done
[Question] 可以提供模型的国内下载源吗
question
Further information is requested
#134
opened Sep 18, 2023 by
liulfy
5 tasks done
[BUG] CUDA Out of Memory when eval model.
bug
Something isn't working
#133
opened Sep 12, 2023 by
Crystalxd
5 tasks done
[Question] DeepSpeed Zero3 save_checkpoint() got empty mode_states files
question
Further information is requested
#132
opened Sep 11, 2023 by
mynewstart
5 tasks done
能提供个类似open_api.py的文件,可以供我们使用接口进行测试吗?
question
Further information is requested
#131
opened Sep 11, 2023 by
mawenju203
5 tasks done
[Question] 请问7B没有用上FlashAttention吗?
question
Further information is requested
#130
opened Sep 7, 2023 by
nezhazheng
5 tasks done
[Evaluation] 提供 Baichuan 模型在 OpenCompass 上的评测结果
question
Further information is requested
#128
opened Sep 6, 2023 by
Leymore
4 of 5 tasks
[Question] Baichuan-7B多GPU 原生部署、 int8 和 int4 量化部署
question
Further information is requested
#127
opened Aug 29, 2023 by
potong
5 tasks done
[Question] 关于数据处理的疑问
question
Further information is requested
#124
opened Aug 22, 2023 by
mynewstart
5 tasks done
我要做预训练通用模型,样本数据加载这里可以给个demo数据?
question
Further information is requested
#121
opened Aug 16, 2023 by
wangweihua11
5 tasks done
请问想接上下句古诗 需要怎么写提示词?
question
Further information is requested
#120
opened Aug 15, 2023 by
goog
5 tasks done
pretrain learning rate is le-8?
question
Further information is requested
#119
opened Aug 12, 2023 by
hegang1-tal
5 tasks done
请问部署后,如何通过API调用?
question
Further information is requested
#118
opened Aug 12, 2023 by
lemon-simple
5 tasks done
[Question] 你好,训练分词模型的代码可以分享吗?或者有什么参考吗?
question
Further information is requested
#117
opened Aug 12, 2023 by
StarrySeas1
5 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.