Oyster

community

Activity Feed Request to join this org

AI & ML interests

可靠可信的人工智能

Recent Activity

jiaxiaojunQAQ authored a paper 28 days ago

OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation

Ranjie updated a model 4 months ago

OysterAI/Qwen2.5-3B-Instruct-SAEs

Ranjie updated a collection 4 months ago

Safe-SAIL

View all activity

jiaxiaojunQAQ

authored a paper 28 days ago

OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation

Paper • 2512.06589 • Published about 1 month ago • 17

Ranjie

updated a model 4 months ago

OysterAI/Qwen2.5-3B-Instruct-SAEs

Updated Sep 23, 2025

Ranjie

updated 2 collections 4 months ago

Safe-SAIL

Collection

A Fine-grained Safety Landscape of Large Language Models • 1 item • Updated Sep 23, 2025

Oyster-I

Collection

The Oyster I is a set of safety models developed in-house by Alibaba-AAIG, devoted to building a responsible AI ecosystem. • 5 items • Updated Sep 23, 2025 • 1

Jacqueline9623

published a model 4 months ago

OysterAI/Qwen2.5-3B-Instruct-SAEs

Updated Sep 23, 2025

Ranjie

updated 2 models 4 months ago

OysterAI/Oyster_1_Deepseek_14B

15B • Updated Sep 11, 2025

OysterAI/Oyster_1_Qwen_14B

15B • Updated Sep 11, 2025 • 8

Ranjie

in OysterAI/Oyster_1_Qwen_14B 4 months ago

这个似乎没法去掉reason?

#1 opened 4 months ago by

caoyizhen

Ranjie

updated a collection 4 months ago

Oyster-I

Collection

The Oyster I is a set of safety models developed in-house by Alibaba-AAIG, devoted to building a responsible AI ecosystem. • 5 items • Updated Sep 23, 2025 • 1

zhaoshiji123

published a dataset 4 months ago

OysterAI/Strata-Sword

Viewer • Updated Sep 5, 2025 • 5 • 85 • 1

zhaoshiji123

updated a dataset 4 months ago

OysterAI/Strata-Sword

Viewer • Updated Sep 5, 2025 • 5 • 85 • 1

Ranjie

published a Space 4 months ago

README

🌖

Ranjie

published a dataset 4 months ago

OysterAI/Constructive_Benchmark

Viewer • Updated Aug 29, 2025 • 383 • 9

Ranjie

published 2 models 4 months ago

OysterAI/Oyster_1_Deepseek_14B

15B • Updated Sep 11, 2025

OysterAI/Oyster_1_Qwen_14B

15B • Updated Sep 11, 2025 • 8

Ycccz

updated a model 4 months ago

OysterAI/Oyster_1_Qwen_14B

15B • Updated Sep 11, 2025 • 8

RosyCheng

authored 4 papers 6 months ago

Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM Alignment

Paper • 2503.18991 • Published Mar 23, 2025

PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack for Toxicity Maximization

Paper • 2412.05892 • Published Dec 8, 2024

Gibberish is All You Need for Membership Inference Detection in Contrastive Language-Audio Pretraining

Paper • 2410.18371 • Published Oct 24, 2024

TUNI: A Textual Unimodal Detector for Identity Inference in CLIP Models

Paper • 2405.14517 • Published May 23, 2024

AI & ML interests

Recent Activity

Team members 9

OysterAI's activity

这个似乎没法去掉reason?

README