Nature Search

Defending ChatGPT against jailbreak attack via self-reminders

Interest in using large language models such as ChatGPT has grown rapidly, but concerns about safe and responsible use have emerged, in part because adversarial prompts can bypass existing safeguards with so-called jailbreak attacks. Wu et al. build a dataset of various types of jailbreak attack prompt and demonstrate a simple but effective technique to counter these attacks by encapsulating users’ prompts in another standard prompt that reminds ChatGPT to respond responsibly.

Yueqi Xie
Jingwei Yi
Fangzhao Wu
Research12 Dec 2023
Nature Machine Intelligence

Volume: 5, P: 1486-1496
Differentially private knowledge transfer for federated learning

To ensure the privacy of processed data, federated learning approaches involve local differential privacy techniques which however require communicating a large amount of data that needs protection. The authors propose here a framework that uses selected small data to transfer knowledge in federated learning with privacy guarantees.

Tao Qi
Fangzhao Wu
Xing Xie
ResearchOpen Access24 Jun 2023
Nature Communications

Volume: 14, P: 1-9
A federated graph neural network framework for privacy-preserving personalization

Mainstream personalization methods rely on centralized Graph Neural Network learning on global graphs, which have considerable privacy risks due to the privacy-sensitive nature of user data. Here, the authors present a federated GNN framework for both effective and privacy-preserving personalization.

Chuhan Wu
Fangzhao Wu
Xing Xie
ResearchOpen Access02 Jun 2022
Nature Communications

Volume: 13, P: 1-10
Communication-efficient federated learning via knowledge distillation

This work presents a communication-efficient federated learning method that saves a major fraction of communication cost. It reveals the advantage of reciprocal learning in machine knowledge transfer and the evolutional low-rank properties of deep model updates.

Chuhan Wu
Fangzhao Wu
Xing Xie
ResearchOpen Access19 Apr 2022
Nature Communications

Volume: 13, P: 1-8
Selective knowledge sharing for privacy-preserving federated distillation without a good teacher

While federated learning is promising for efficient collaborative learning without revealing local data, it remains vulnerable to white-box privacy attacks, suffers from high communication overhead, and struggles to adapt to heterogeneous models. Here, the authors show a federated distillation method to tackle these challenges, which leverages the strengths of knowledge distillation in a federated learning setting.

Jiawei Shao
Fangzhao Wu
Jun Zhang
ResearchOpen Access08 Jan 2024
Nature Communications

Volume: 15, P: 1-11
Removing AI’s sentiment manipulation of personalized news delivery

Chuhan Wu
Fangzhao Wu
Yongfeng Huang
ResearchOpen Access20 Dec 2022
Humanities and Social Sciences Communications

Volume: 9, P: 1-9

Search

Filter By:

Defending ChatGPT against jailbreak attack via self-reminders

Differentially private knowledge transfer for federated learning

A federated graph neural network framework for privacy-preserving personalization

Communication-efficient federated learning via knowledge distillation

Selective knowledge sharing for privacy-preserving federated distillation without a good teacher

Removing AI’s sentiment manipulation of personalized news delivery

Search

Quick links