Monday, August 25, 2025
No Result
View All Result
Coin Digest Daily
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • Metaverse
  • Web3
  • DeFi
  • Analysis
  • Scam Alert
  • Regulations
Marketcap
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • Metaverse
  • Web3
  • DeFi
  • Analysis
  • Scam Alert
  • Regulations
No Result
View All Result
Coin Digest Daily
No Result
View All Result

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

21 October 2024
in Web3
Reading Time: 3 mins read
0 0
A A
0
Home Web3
Share on FacebookShare on Twitter


Receive, Manage & Grow Your Crypto Investments With Brighty

SolidityBench by IQ has launched as the primary leaderboard to judge LLMs in Solidity code era. Out there on Hugging Face, it introduces two progressive benchmarks, NaïveJudge and HumanEval for Solidity, designed to evaluate and rank the proficiency of AI fashions in producing sensible contract code.

Developed by IQ’s BrainDAO as a part of its forthcoming IQ Code suite, SolidityBench serves to refine their very own EVMind LLMs and examine them towards generalist and community-created fashions. IQ Code goals to supply AI fashions tailor-made for producing and auditing sensible contract code, addressing the rising want for safe and environment friendly blockchain purposes.

As IQ instructed CryptoSlate, NaïveJudge affords a novel method by tasking LLMs with implementing sensible contracts primarily based on detailed specs derived from audited OpenZeppelin contracts. These contracts present a gold customary for correctness and effectivity. The generated code is evaluated towards a reference implementation utilizing standards resembling purposeful completeness, adherence to Solidity finest practices and safety requirements, and optimization effectivity.

The analysis course of leverages superior LLMs, together with completely different variations of OpenAI’s GPT-4 and Claude 3.5 Sonnet as neutral code reviewers. They assess the code primarily based on rigorous standards, together with implementing all key functionalities, dealing with edge circumstances, error administration, correct syntax utilization, and general code construction and maintainability.

Optimization issues resembling fuel effectivity and storage administration are additionally evaluated. Scores vary from 0 to 100, offering a complete evaluation throughout performance, safety, and effectivity, mirroring the complexities {of professional} sensible contract growth.

Which AI fashions are finest for solidity sensible contract growth?

Benchmarking outcomes confirmed that OpenAI’s GPT-4o mannequin achieved the very best general rating of 80.05, with a NaïveJudge rating of 72.18 and HumanEval for Solidity go charges of 80% at go@1 and 92% at go@3.

Apparently, newer reasoning fashions like OpenAI’s o1-preview and o1-mini had been overwhelmed to the highest spot, scoring 77.61 and 75.08, respectively. Fashions from Anthropic and XAI, together with Claude 3.5 Sonnet and grok-2, demonstrated aggressive efficiency with general scores hovering round 74. Nvidia’s Llama-3.1-Nemotron-70B scored lowest within the high 10 at 52.54.

SolidityBench scores for LLMs (Hugging Face)
SolidityBench scores for LLMs (Hugging Face)

Per IQ, HumanEval for Solidity adapts OpenAI’s unique HumanEval benchmark from Python to Solidity, encompassing 25 duties of various issue. Every job consists of corresponding exams suitable with Hardhat, a well-liked Ethereum growth atmosphere, facilitating correct compilation and testing of generated code. The analysis metrics, go@1 and go@3, measure the mannequin’s success on preliminary makes an attempt and over a number of tries, providing insights into each precision and problem-solving capabilities.

Targets of using AI fashions in sensible contract growth

By introducing these benchmarks, SolidityBench seeks to advance AI-assisted sensible contract growth. It encourages the creation of extra subtle and dependable AI fashions whereas offering builders and researchers with invaluable insights into AI’s present capabilities and limitations in Solidity growth.

The benchmarking toolkit goals to advance IQ Code’s EVMind LLMs and in addition units new requirements for AI-assisted sensible contract growth throughout the blockchain ecosystem. The initiative hopes to handle a vital want within the business, the place the demand for safe and environment friendly sensible contracts continues to develop.

Builders, researchers, and AI fanatics are invited to discover and contribute to SolidityBench, which goals to drive the continual refinement of AI fashions, promote finest practices, and advance decentralized purposes.

Go to the SolidityBench leaderboard on Hugging Face to be taught extra and start benchmarking Solidity era fashions.

🤖 Prime AI Crypto Property

View AllMentioned on this article



Source link

Tags: CodecontractGPTModelOpenAIrankedsmartSolidityWriting
Previous Post

Land a Six-Figure Salary Job as a Blockchain Developer Land a Six-Figure Salary Job as a Blockchain Developer

Next Post

Grayson Perry and Cornelia Parker take the 10gram sculpture challenge

Related Posts

Why Luca Netz Will Be ‘Disappointed’ If Pudgy Penguins Doesn’t IPO Within 2 Years – Decrypt
Web3

Why Luca Netz Will Be ‘Disappointed’ If Pudgy Penguins Doesn’t IPO Within 2 Years – Decrypt

24 August 2025
Anonymous Hacktivist Group Founder Spearheads Meme Coin While Facing 5 Years in Prison – Decrypt
Web3

Anonymous Hacktivist Group Founder Spearheads Meme Coin While Facing 5 Years in Prison – Decrypt

23 August 2025
Chipotle Launches ‘Zipotle’ Drone Deliveries in Texas – Decrypt
Web3

Chipotle Launches ‘Zipotle’ Drone Deliveries in Texas – Decrypt

22 August 2025
XRP Ledger Developers Refute Last-Place Security Ranking Among Blockchains – Decrypt
Web3

XRP Ledger Developers Refute Last-Place Security Ranking Among Blockchains – Decrypt

21 August 2025
OpenAI CEO Sam Altman Concedes GPT-5 Was a Misfire, Bets on GPT-6 – Decrypt
Web3

OpenAI CEO Sam Altman Concedes GPT-5 Was a Misfire, Bets on GPT-6 – Decrypt

20 August 2025
Bitcoin Treasury KindlyMD Stock Dives Following $679 Million BTC Buy – Decrypt
Web3

Bitcoin Treasury KindlyMD Stock Dives Following $679 Million BTC Buy – Decrypt

19 August 2025
Next Post
Grayson Perry and Cornelia Parker take the 10gram sculpture challenge

Grayson Perry and Cornelia Parker take the 10gram sculpture challenge

PayPal’s Move to Zero Fees for International Crypto Transfers

PayPal's Move to Zero Fees for International Crypto Transfers

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
FTT jumps 7% as Backpack launches platform to help FTX victims liquidate claims – CoinJournal

FTT jumps 7% as Backpack launches platform to help FTX victims liquidate claims – CoinJournal

19 July 2025
PENDLE token goes live on BeraChain and HyperEVM to expand cross-chain utility – CoinJournal

PENDLE token goes live on BeraChain and HyperEVM to expand cross-chain utility – CoinJournal

30 July 2025
A Russian Hacking Group Is Using Fake Versions of MetaMask to Steal $1M in Crypto – Decrypt

A Russian Hacking Group Is Using Fake Versions of MetaMask to Steal $1M in Crypto – Decrypt

10 August 2025
Ethereum Reclaims $4,600 With Unprecedented $1 Billion In Spot ETF Inflow

Ethereum Reclaims $4,600 With Unprecedented $1 Billion In Spot ETF Inflow

13 August 2025
XRP Price Blasts Higher by 10%, Bulls Eye Even Bigger Gains

XRP Price Blasts Higher by 10%, Bulls Eye Even Bigger Gains

8 August 2025
PEPE Gears Up For 120% Move As Indicators Point To An End Of Decline | Bitcoinist.com

PEPE Gears Up For 120% Move As Indicators Point To An End Of Decline | Bitcoinist.com

8 August 2025
Hacker Moves Loot: Over 38,000 Solana Purchased With Stolen Crypto

Hacker Moves Loot: Over 38,000 Solana Purchased With Stolen Crypto

25 August 2025
Ether Soars In August—But Will September Spoil The Party?

Ether Soars In August—But Will September Spoil The Party?

24 August 2025
BlockDAG’s Presale Path to $1 Target as Solana and Ripple Navigate Markets

BlockDAG’s Presale Path to $1 Target as Solana and Ripple Navigate Markets

24 August 2025
Solana Eyes $360 After Breaking $200 – Here’s Why $SNORT Could Deliver Bigger Gains

Solana Eyes $360 After Breaking $200 – Here’s Why $SNORT Could Deliver Bigger Gains

24 August 2025
Wall Street’s Crypto Titans: Billions in Bitcoin and Ethereum Stashed Away – Crypto News Bitcoin News

Wall Street’s Crypto Titans: Billions in Bitcoin and Ethereum Stashed Away – Crypto News Bitcoin News

24 August 2025
Why Luca Netz Will Be ‘Disappointed’ If Pudgy Penguins Doesn’t IPO Within 2 Years – Decrypt

Why Luca Netz Will Be ‘Disappointed’ If Pudgy Penguins Doesn’t IPO Within 2 Years – Decrypt

24 August 2025
Facebook Twitter Instagram Youtube RSS
Coin Digest Daily

Stay ahead in the world of cryptocurrencies with Coin Digest Daily. Your daily dose of insightful news, market trends, and expert analyses. Empowering you to make informed decisions in the ever-evolving blockchain space.

CATEGORIES

  • Altcoin
  • Analysis
  • Bitcoin
  • Blockchain
  • Crypto Exchanges
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • NFT
  • Regulations
  • Scam Alert
  • Web3

SITEMAP

  • About us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2024 Coin Digest Daily.
Coin Digest Daily is not responsible for the content of external sites.

No Result
View All Result
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • Blockchain
  • NFT
  • Metaverse
  • Web3
  • DeFi
  • Analysis
  • Scam Alert
  • Regulations

Copyright © 2024 Coin Digest Daily.
Coin Digest Daily is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
  • bitcoinBitcoin(BTC)$113,422.00-1.68%
  • ethereumEthereum(ETH)$4,779.560.12%
  • rippleXRP(XRP)$3.03-0.64%
  • tetherTether(USDT)$1.000.03%
  • binancecoinBNB(BNB)$875.93-0.49%
  • solanaSolana(SOL)$205.810.88%
  • usd-coinUSDC(USDC)$1.000.00%
  • staked-etherLido Staked Ether(STETH)$4,768.010.07%
  • dogecoinDogecoin(DOGE)$0.231703-1.78%
  • tronTRON(TRX)$0.361538-0.20%