Forgot Password!Reset here

Don't have an account?Join us

Already have an account?Log In

By clicking "Sign Up" you agree to our terms of service and privacy policy

Username should be more than 3 characters.
Username cannot start with numeric character.
Username characters must be from {a-z,0-9}, special characters are not allowed.
Make sure the Email is working to receive verification code & password reset link.
Password should be more than 6 characters.

Forgot Password

Authentication

Sorry, you have been logged out
Please Login to proceed....

How a Single Keystroke Can Crack AI Safeguards: The TokenBreak Threat

14 Jun, 25

The TokenBreak attack represents a significant vulnerability in AI security, allowing hackers to bypass large language models' (LLMs) content moderation by altering a single character in their input. This technique exploits the tokenization strategy used by LLMs, which breaks down text into smaller units called tokens. By manipulating these tokens, attackers can induce false negatives, making the models fail to detect malicious content. Despite the model's ability to understand the intent behind the altered text, it still responds as if the input were legitimate, posing serious risks. Models using robust tokenization methods like Unigram tokenizers are more resistant to this attack. The discovery highlights the need for enhanced security measures in AI systems to prevent such exploits.

Techradar

Copied

Original Source