Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Electrical and Electronic Engineering
  4. Electrical and Electronic Engineering
  5. ITERA-LLM: Boosting sub-8-bit Large Language Model inference via iterative tensor decomposition
 
  • Details
ITERA-LLM: Boosting sub-8-bit Large Language Model inference via iterative tensor decomposition
File(s)
2505.08981v1.pdf (2.7 MB)
Preprint
Author(s)
Zheng, Keran
Huang, Yinting
Yu, Zhewen
Bouganis, Christos-Savvas
Type
preprint
Date Issued
2025-05-13
Citation
arXiv, 2025
URI
https://hdl.handle.net/10044/1/120411
URL
https://arxiv.org/abs/2505.08981v1
DOI
https://www.dx.doi.org/10.48550/arXiv.2505.08981
Journal / Book Title
arXiv
Copyright Statement
Copyright © 2025 The Author(s). This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (https://creativecommons.org/licenses/by-nc-sa/4.0/).
License URL
https://creativecommons.org/licenses/by-nc-sa/4.0/
Identifier
http://arxiv.org/abs/2505.08981v1
Subjects
cs.AR
cs.AR
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback