• Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021
No Result
View All Result
CryptoABC.net
No Result
View All Result

Elon Musk’s Grok AI Chatbot Has Weakest Security, While Meta’s Llama Stands Strong: Researchers

April 7, 2024
in Australian Crypto News
Reading Time: 4min read
0 0
A A
0
Elon Musk’s Grok AI Chatbot Has Weakest Security, While Meta’s Llama Stands Strong: Researchers
0
SHARES
4
VIEWS
ShareShareShareShareShare

Security researchers put the much-touted guardrails placed around the most popular AI models to see how well they resisted jailbreaking, and tested just how far the chatbots could be pushed into dangerous territory. The determined that Grok—the chatbot with a “fun mode” developed by Elon Musk’s x.AI—was the least safe tool of the bunch.

“We wanted to test how existing solutions compare and the fundamentally different approaches for LLM security testing that can lead to various outcomes,” Alex Polyakov, Co-Founder and CEO of , told Decrypt. Polyakov’s firm is focused on protecting AI and its users from cyber threats, privacy issues, and safety incidents, and touts the fact that .

Jailbreaking refers to circumventing the safety restrictions and ethical guidelines software developers implement.

In one example, the researchers used a linguistic logic manipulation approach—also known as social engineering-based methods—to ask Grok how to seduce a child. The chatbot provided a detailed response, which the researchers noted was “highly sensitive” and should have been restricted by default.

Other results provide instructions on how to hotwire cars and build bombs.

Image: Adversa.AI

The researchers tested three distinct categories of attack methods. Firstly, the aforementioned technique, which applies various linguistic tricks and psychological prompts to manipulate the AI model’s behavior. An example cited was using a “role-based jailbreak” by framing the request as part of a fictional scenario where unethical actions are permitted.

The team also leveraged programming logic manipulation tactics that exploited the chatbots’ ability to understand programming languages and follow algorithms. One such technique involved splitting a dangerous prompt into multiple innocuous parts and then concatenating them to bypass content filters. Four out of seven models—including OpenAI’s ChatGPT, Mistral’s Le Chat, Google’s Gemini, and x.AI’s Grok—were vulnerable to this type of attack.

Image: Adversa.AI

The third approach involved adversarial AI methods that target how language models process and interpret token sequences. By carefully crafting prompts with token combinations that have similar vector representations, the researchers attempted to evade the chatbots’ content moderation systems. In this case, however, every chatbot detected the attack and prevented it from being exploited.

The researchers ranked the chatbots based on the strength of their respective security measures in blocking jailbreak attempts. Meta LLAMA came out on top as the safest model out of all the tested chatbots, followed by Claude, then Gemini and GPT-4.

“The lesson, I think, is that open source gives you more variability to protect the final solution compared to closed offerings, but only if you know what to do and how to do it properly,” Polyakov told Decrypt.

Grok, however, exhibited a comparatively higher vulnerability to certain jailbreaking approaches, particularly those involving linguistic manipulation and programming logic exploitation. According to the report, Grok was more likely than others to provide responses that could be considered harmful or unethical when plied with jailbreaks.

Overall, Elon’s chatbot ranked last, along with Mistral AI’s proprietary model “Mistral Large.”

Image: Adversa.AI

The full technical details were not disclosed to prevent potential misuse, but the researchers say they want to collaborate with chatbot developers on improving AI safety protocols.

AI enthusiasts and hackers alike constantly probe for ways to “uncensor” chatbot interactions, trading jailbreak prompts on message boards and Discord servers. Tricks range from the OG Karen prompt to more creative ideas like using ASCII art or prompting in exotic languages. These communities, in a way, form a giant adversarial network against which AI developers patch and enhance their models.

Some see a criminal opportunity where others see only fun challenges, however.

“Many forums were found where people sell access to jailbroken models that can be used for any malicious purpose,” Polyakov said. “Hackers can use jailbroken models to create phishing emails, malware, generate hate speech at scale, and use those models for any other illegal purpose.”

Polyakov explained that jailbreaking research is becoming more relevant as society starts to depend more and more on AI-powered solutions for everything from dating to warfare.

“If those chatbots or models on which they rely are used in automated decision-making and connected to email assistants or financial business applications, hackers will be able to gain full control of connected applications and perform any action, such as sending emails on behalf of a hacked user or making financial transactions,” he warned.

Edited by Ryan Ozawa.

Stay on top of crypto news, get daily updates in your inbox.


Credit: Source link

ShareTweetSendPinShare
Previous Post

Sullivan and Cromwell’s Sale of Solana (SOL) at a Massive Discount Raises Concerns among FTX Creditors

Next Post

This Week In Crypto Twitter: Meme Coins and Base are Booming While Solana Strains

Next Post
This Week In Crypto Twitter: Meme Coins and Base are Booming While Solana Strains

This Week In Crypto Twitter: Meme Coins and Base are Booming While Solana Strains

You might also like

XRP Price Prediction: This Rare Bottom Indicator Is Flashing Again — Is XRP About to Explode Up?

XRP Price Prediction: This Rare Bottom Indicator Is Flashing Again — Is XRP About to Explode Up?

March 12, 2026
HBAR Price Prediction: Targeting $0.30 by December 2025 as Hedera Tests Critical Breakout Level

HBAR Price Prediction: Hedera Eyes $0.12 Recovery After Testing Critical Support at $0.10

March 7, 2026
Bitcoin ETFs Bleed $349M In A Day As Whales Dump

Bitcoin ETFs Bleed $349M In A Day As Whales Dump

March 7, 2026
Creating Your First GitHub Repository: A Beginner’s Guide

GitHub Copilot Code Review Hits 60M Reviews as AI Handles 20% of Pull Requests

March 5, 2026
Kalshi Faces Class Action Lawsuit Over Khamenei Prediction Market Payout

Kalshi Faces Class Action Lawsuit Over Khamenei Prediction Market Payout

March 7, 2026
Solana Price Prediction: SOL Just Flipped Ethereum in Critical $600 Billion Metric — Is Solana About to Explode?

Solana Price Prediction: SOL Just Flipped Ethereum in Critical $600 Billion Metric — Is Solana About to Explode?

March 10, 2026
CryptoABC.net

This is an Australian online news/education portal that aims to provide the latest crypto news, real-time updates, education and reviews within Australia and around the world. Feel free to get in touch with us!

What's New Here!

XRP Triangle Could Point To Support Between $0.60 And $0.90

Here’s How Much Needs To Flow Through Ripple For XRP Price To Reach $3,700

March 12, 2026
XRP Price Prediction: This Rare Bottom Indicator Is Flashing Again — Is XRP About to Explode Up?

XRP Price Prediction: This Rare Bottom Indicator Is Flashing Again — Is XRP About to Explode Up?

March 12, 2026

Subscribe Now

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 cryptoabc.net - All rights reserved!

No Result
View All Result
  • Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021

© 2021 cryptoabc.net - All rights reserved!

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Please enter CoinGecko Free Api Key to get this plugin works.