DeFi Daily News
Monday, November 3, 2025
Advertisement
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos
No Result
View All Result
DeFi Daily News
No Result
View All Result
Home DeFi Web 3

IQ ranking declares OpenAI GPT 4o as the top AI model for generating Solidity smart contract code

Liam 'Akiba' Wright by Liam 'Akiba' Wright
October 21, 2024
in Web 3
0 0
0
IQ ranking declares OpenAI GPT 4o as the top AI model for generating Solidity smart contract code
0
SHARES
0
VIEWS
Share on FacebookShare on TwitterShare on Telegram
Listen to this article

Receive, Manage & Grow Your Crypto Investments With Brighty

SolidityBench by IQ has launched as the first leaderboard to evaluate LLMs in Solidity code generation. Available on Hugging Face, it introduces two innovative benchmarks, NaïveJudge and HumanEval for Solidity, designed to assess and rank the proficiency of AI models in generating smart contract code.

Developed by IQ’s BrainDAO as part of its forthcoming IQ Code suite, SolidityBench serves to refine their own EVMind LLMs and compare them against generalist and community-created models. IQ Code aims to offer AI models tailored for generating and auditing smart contract code, addressing the growing need for secure and efficient blockchain applications.

As IQ told CryptoSlate, NaïveJudge offers a novel approach by tasking LLMs with implementing smart contracts based on detailed specifications derived from audited OpenZeppelin contracts. These contracts provide a gold standard for correctness and efficiency. The generated code is evaluated against a reference implementation using criteria such as functional completeness, adherence to Solidity best practices, security standards, and optimization efficiency.

The evaluation process leverages advanced LLMs, including different versions of OpenAI’s GPT-4 and Claude 3.5 Sonnet as impartial code reviewers. They assess the code based on rigorous criteria, including implementing all key functionalities, handling edge cases, error management, proper syntax usage, and overall code structure and maintainability.

Optimization considerations such as gas efficiency and storage management are also evaluated. Scores range from 0 to 100, providing a comprehensive assessment across functionality, security, and efficiency, mirroring the complexities of professional smart contract development.

Which AI models are best for solidity smart contract development?

Benchmarking results showed that OpenAI’s GPT-4o model achieved the highest overall score of 80.05, with a NaïveJudge score of 72.18 and HumanEval for Solidity pass rates of 80% at pass@1 and 92% at pass@3.

Interestingly, newer reasoning models like OpenAI’s o1-preview and o1-mini were beaten to the top spot, scoring 77.61 and 75.08, respectively. Models from Anthropic and XAI, including Claude 3.5 Sonnet and grok-2, demonstrated competitive performance with overall scores hovering around 74. Nvidia’s Llama-3.1-Nemotron-70B scored lowest in the top 10 at 52.54.

SolidityBench scores for LLMs (Hugging Face)
SolidityBench scores for LLMs (Hugging Face)

Per IQ, HumanEval for Solidity adapts OpenAI’s original HumanEval benchmark from Python to Solidity, encompassing 25 tasks of varying difficulty. Each task includes corresponding tests compatible with Hardhat, a popular Ethereum development environment, facilitating accurate compilation and testing of generated code. The evaluation metrics, pass@1 and pass@3, measure the model’s success on initial attempts and over multiple tries, offering insights into both precision and problem-solving capabilities.

Goals of utilizing AI models in smart contract development

By introducing these benchmarks, SolidityBench seeks to advance AI-assisted smart contract development. It encourages the creation of more sophisticated and reliable AI models while providing developers and researchers with valuable insights into AI’s current capabilities and limitations in Solidity development.

The benchmarking toolkit aims to advance IQ Code’s EVMind LLMs and also sets new standards for AI-assisted smart contract development across the blockchain ecosystem. The initiative hopes to address a critical need in the industry, where the demand for secure and efficient smart contracts continues to grow.

Developers, researchers, and AI enthusiasts are invited to explore and contribute to SolidityBench, which aims to drive the continuous refinement of AI models, promote best practices, and advance decentralized applications.

Visit the SolidityBench leaderboard on Hugging Face to learn more and begin benchmarking Solidity generation models.

🤖 Top AI Crypto Assets

View AllMentioned in this article

And that concludes our deep dive into the world of AI models for Solidity smart contract development. The benchmarking results, innovative approaches, and the potential for advancing AI-assisted development are exciting prospects for the blockchain industry. As we look towards the future, the collaboration between AI and smart contracts promises even more secure, efficient, and sophisticated applications. Stay tuned for more exciting developments in the DeFi space on DeFi Daily News!



Source link

Tags: CodeContractdeclaresGeneratingGPTModelOpenAIRankingSmartSolidityTop
ShareTweetShare
Previous Post

Google’s AI Podcast Creator Takes the Internet by Storm: Welcome to a New Chapter in Content Creation – Metaverseplanet.net

Next Post

OpenAI GPT-4o named top AI model for crafting Solidity smart contract code by IQ | Coin Media

Next Post
OpenAI GPT-4o named top AI model for crafting Solidity smart contract code by IQ | Coin Media

OpenAI GPT-4o named top AI model for crafting Solidity smart contract code by IQ | Coin Media

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

No Result
View All Result
  • Trending
  • Comments
  • Latest
rewrite this title Ripple News: First U.S. Spot XRP ETF Surpasses 0 Million in Assets

rewrite this title Ripple News: First U.S. Spot XRP ETF Surpasses $100 Million in Assets

October 26, 2025
rewrite this title and make it good for SEO MEXC Vs KuCoin 2025: Which Exchange Is Better?

rewrite this title and make it good for SEO MEXC Vs KuCoin 2025: Which Exchange Is Better?

October 26, 2025
Why Outlet Malls Are Struggling In The U.S.

Why Outlet Malls Are Struggling In The U.S.

July 16, 2024
MAGA-Themed Cryptocurrency Surges as Donald Trump’s Presidential Election Odds Increase on Polymarket – The Daily Hodl

MAGA-Themed Cryptocurrency Surges as Donald Trump’s Presidential Election Odds Increase on Polymarket – The Daily Hodl

July 15, 2024
Living Paycheck-to-Paycheck After a Breakup (K Car Debt)

Living Paycheck-to-Paycheck After a Breakup ($52K Car Debt)

July 5, 2024
Driving Innovation: NFTs and the Tech Industry

Driving Innovation: NFTs and the Tech Industry

September 20, 2024
rewrite this title “You kept playing him until there was criticism” – Aakash Chopra questions India’s tactics in AUS vs IND 2025 3rd T20I

rewrite this title “You kept playing him until there was criticism” – Aakash Chopra questions India’s tactics in AUS vs IND 2025 3rd T20I

November 3, 2025
rewrite this title TubeBuddy AI Review – The Smart YouTube Optimization Suite Built for Serious Growth

rewrite this title TubeBuddy AI Review – The Smart YouTube Optimization Suite Built for Serious Growth

November 3, 2025
rewrite this title Bitcoin Hyper Presale Rockets Past .6M — Could It Be Crypto’s Next Breakout Star?

rewrite this title Bitcoin Hyper Presale Rockets Past $25.6M — Could It Be Crypto’s Next Breakout Star?

November 3, 2025
rewrite this title Trump Downplays Knowledge of Binance Chief After Pardon Linked to Family’s Crypto Dealings – Decrypt

rewrite this title Trump Downplays Knowledge of Binance Chief After Pardon Linked to Family’s Crypto Dealings – Decrypt

November 3, 2025
rewrite this title with good SEO XRP Price Stays Weak — Bearish Outlook Intact Under .60 Resistance

rewrite this title with good SEO XRP Price Stays Weak — Bearish Outlook Intact Under $2.60 Resistance

November 3, 2025
rewrite this title China’s Baidu says weekly robotaxi rides hit 250,000 — same as Alphabet’s Waymo this spring

rewrite this title China’s Baidu says weekly robotaxi rides hit 250,000 — same as Alphabet’s Waymo this spring

November 2, 2025
DeFi Daily

Stay updated with DeFi Daily, your trusted source for the latest news, insights, and analysis in finance and cryptocurrency. Explore breaking news, expert analysis, market data, and educational resources to navigate the world of decentralized finance.

  • About Us
  • Blogs
  • DeFi-IRA | Learn More.
  • Advertise with Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Cryptocurrency
    • Bitcoin
    • Ethereum
    • Altcoins
    • DeFi-IRA
  • DeFi
    • NFT
    • Metaverse
    • Web 3
  • Finance
    • Business Finance
    • Personal Finance
  • Markets
    • Crypto Market
    • Stock Market
    • Analysis
  • Other News
    • World & US
    • Politics
    • Entertainment
    • Tech
    • Sports
    • Health
  • Videos

Copyright © 2024 Defi Daily.
Defi Daily is not responsible for the content of external sites.