TUHH Open Research
Help
  • Log In
    New user? Click here to register.Have you forgotten your password?
  • English
  • Deutsch
  • Communities & Collections
  • Publications
  • Research Data
  • People
  • Institutions
  • Projects
  • Statistics
  1. Home
  2. TUHH
  3. Publication References
  4. Evaluating the performance of large language models for design validation
 
Options

Evaluating the performance of large language models for design validation

Publikationstyp
Conference Paper
Date Issued
2024-09
Sprache
English
Author(s)
Rahman, Abdur  orcid-logo
Fey, Görschwin  orcid-logo
Eingebettete Systeme E-13  
TORE-URI
https://hdl.handle.net/11420/52445
Citation
37th IEEE International System-on-Chip Conference, SOCC 2024
Contribution to Conference
37th IEEE International System-on-Chip Conference, SOCC 2024  
Publisher DOI
10.1109/SOCC62300.2024.10737717
Scopus ID
2-s2.0-85210574733
Publisher
IEEE
ISBN
979-8-3503-7756-9
979-8-3503-7757-6
With the emergence of Large Language Models (LLMs), there has been a growing interest in harnessing their potential applications beyond traditional natural language processing tasks. One such application is hardware design validation. This paper presents a comprehensive evaluation of LLMs in design validation tasks. In design validation, it is essential to analyze the designs and crafting appropriate testbenches for them. We evaluate the ability to recognize hardware descriptions as well as the ability to generate testbenches for those designs. We present evaluation methodology and benchmarks to evaluate these tasks. Experiments were conducted with four prominent LLMs and designs ranging from small arithmetic block up to a small MIPS CPU. The results demonstrate promising performance for a limited complexity threshold.
Subjects
design | large language model | validation
DDC Class
600: Technology
TUHH
Weiterführende Links
  • Contact
  • Send Feedback
  • Cookie settings
  • Privacy policy
  • Impress
DSpace Software

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science
Design by effective webwork GmbH

  • Deutsche NationalbibliothekDeutsche Nationalbibliothek
  • ORCiD Member OrganizationORCiD Member Organization
  • DataCiteDataCite
  • Re3DataRe3Data
  • OpenDOAROpenDOAR
  • OpenAireOpenAire
  • BASE Bielefeld Academic Search EngineBASE Bielefeld Academic Search Engine
Feedback