Menu
Sign In Search Podcasts Charts People & Topics Add Podcast API Pricing
Podcast Image

AI: post transformers

DeepSeek Safety Concerns

08 Aug 2025

Description

This research paper focuses on a safety evaluation of DeepSeek-R1 and DeepSeek-V3 models within Chinese language contexts, an area previously underexplored. It highlights that while DeepSeek models possess strong reasoning capabilities, previous studies, primarily in English, have revealed significant safety flaws. To address the gap in Chinese safety assessments, the authors introduce CHiSafetyBench, a new benchmark designed to systematically test these models across various safety categories like discrimination and violation of values. The experimental results quantitatively demonstrate the deficiencies of DeepSeek models in Chinese safety performance, particularly in identifying and refusing harmful content, offering insights for future improvements. The authors acknowledge potential biases in their evaluation and plan to continually optimize the benchmark.Source: https://arxiv.org/pdf/2502.11137

Audio
Featured in this Episode

No persons identified in this episode.

Transcription

This episode hasn't been transcribed yet

Help us prioritize this episode for transcription by upvoting it.

0 upvotes
🗳️ Sign in to Upvote

Popular episodes get transcribed faster

Comments

There are no comments yet.

Please log in to write the first comment.