-
Notifications
You must be signed in to change notification settings - Fork 0
Issues: AkihikoWatanabe/paper_notes
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
DeepNet: Scaling Transformers to 1,000 Layers, Hongyu Wang+, arXiv'22
Architecture
NLP
Normalization
Transformer
#1900
opened Apr 19, 2025 by
AkihikoWatanabe
Foundation Transformers, Hongyu Wang+, arXiv'22
Architecture
ComputerVision
MulltiModal
NLP
Normalization
Pocket
SpeechProcessing
Transformer
#1899
opened Apr 19, 2025 by
AkihikoWatanabe
Introducing UI-TARS-1.5, ByteDance, 2025.04
Article
ComputerVision
LLMAgent
MulltiModal
NLP
OpenWeightLLM
Pocket
Reasoning
translation_required
x-Use
#1896
opened Apr 18, 2025 by
AkihikoWatanabe
研究者向けの技術研修資料を公開します, CyberAgent, 2025.04
Article
Tutorial
#1895
opened Apr 18, 2025 by
AkihikoWatanabe
あえて予測の更新頻度を落とす| サプライチェーンの現場目線にたった機械学習の導入, モノタロウ Tech Blog, 2022.03
Article
MachineLearning
#1894
opened Apr 18, 2025 by
AkihikoWatanabe
Learning Dynamics of LLM Finetuning, Yi Ren+, ICLR'25
Analysis
ICLR
LanguageModel
MachineLearning
NLP
Pocket
Supervised-FineTuning (SFT)
#1892
opened Apr 18, 2025 by
AkihikoWatanabe
Non-Determinism of "Deterministic" LLM Settings, Berk Atil+, arXiv'24
Decoding
Evaluation
LanguageModel
NLP
Pocket
#1890
opened Apr 14, 2025 by
AkihikoWatanabe
The Curious Case of Neural Text Degeneration, Ari Holtzman+, ICLR'20
Decoding
ICLR
LanguageModel
NLP
Pocket
#1889
opened Apr 14, 2025 by
AkihikoWatanabe
Seed-Thinking-v1.5, ByteDance, 2025.04
LanguageModel
NLP
OpenWeightLLM
Reasoning
#1886
opened Apr 12, 2025 by
AkihikoWatanabe
Segment Anything, Alexander Kirillov+, arXiv'23
ComputerVision
FoundationModel
ImageSegmentation
Pocket
Transformer
#1885
opened Apr 11, 2025 by
AkihikoWatanabe
Large Vision Language Model (LVLM) に関する最新知見まとめ (Part 1), Daiki Shiono, 2024.11
ComputerVision
LanguageModel
Survey
#1880
opened Apr 11, 2025 by
AkihikoWatanabe
Fiction.liveBench, 2025.04
Dataset
Evaluation
LanguageModel
LongSequence
NLP
#1877
opened Apr 9, 2025 by
AkihikoWatanabe
BFCLv2, UC Berkeley, 2024.08
API
Dataset
LanguageModel
NLP
Tools
#1875
opened Apr 8, 2025 by
AkihikoWatanabe
Previous Next
ProTip!
Find all open issues with in progress development work with linked:pr.