The offense-defense imbalance you mention is the most worring aspect here. Even though Anthropic could se the attacks happening on their infastructure, they still couldn't prevent them completly. When open-weight models catch up in capability, this detection window dissappears entirely.
The offense-defense imbalance you mention is the most worring aspect here. Even though Anthropic could se the attacks happening on their infastructure, they still couldn't prevent them completly. When open-weight models catch up in capability, this detection window dissappears entirely.
awesome! that is immensely useful and informative
thank you
Thank you for this information!