History

caojiewen da60f433f1 removed the useless link of apply form		4 years ago
..
scripts	add ldp_linucb into model zoo	5 years ago

src	add ldp_linucb into model zoo	5 years ago

README.md	removed the useless link of apply form	4 years ago

result1.png	add ldp_linucb into model zoo	5 years ago

result2.png	add ldp_linucb into model zoo	5 years ago

train_eval.py	add ldp_linucb into model zoo	5 years ago

LDP LinUCB

Locally Differentially Private (LDP) LinUCB is a variant of LinUCB bandit algorithm with local differential privacy guarantee, which can preserve users' personal data with theoretical guarantee.

Paper: Kai Zheng, Tianle Cai, Weiran Huang, Zhenguo Li, Liwei Wang. "Locally Differentially Private (Contextual) Bandits Learning." Advances in Neural Information Processing Systems. 2020.

Model Architecture

The server interacts with users in rounds. For a coming user, the server first transfers the current model parameters to the user. In the user side, the model chooses an action based on the user feature to play (e.g., choose a movie to recommend), and observes a reward (or loss) value from the user (e.g., rating of the movie). Then we perturb the data to be transferred by adding Gaussian noise. Finally, the server receives the perturbed data and updates the model. Details can be found in the original paper.

Dataset

Note that you can run the scripts based on the dataset mentioned in original paper. In the following sections, we will introduce how to run the scripts using the related dataset below.

Dataset used: MovieLens 100K

Dataset size：5MB, 100,000 ratings (1-5) from 943 users on 1682 movies.
Data format：csv/txt files

Environment Requirements

Hardware (Ascend/GPU)
- Prepare hardware environment with Ascend or GPU processor.
Framework
- MindSpore
For more information, please check the resources below：
- MindSpore Tutorials
- MindSpore Python API

Script Description

Script and Sample Code

├── model_zoo
    ├── README.md                                // descriptions about all the models
    ├── research
        ├── rl
            ├── ldp_linucb
                ├── README.md                    // descriptions about LDP LinUCB
                ├── scripts
                │   ├── run_train_eval.sh        // shell script for running on Ascend
                ├── src
                │   ├── dataset.py               // dataset for movielens
                │   ├── linucb.py                // model
                ├── train_eval.py                // training script
                ├── result1.png                  // experimental result
                ├── result2.png                  // experimental result

Script Parameters

Parameters for preparing MovieLens 100K dataset

'num_actions': 20         # number of candidate movies to be recommended
'rank_k': 20              # rank of rating matrix completion

Parameters for LDP LinUCB, MovieLens 100K dataset

'epsilon': 8e5            # privacy parameter
'delta': 0.1              # privacy parameter
'alpha': 0.1              # failure probability
'iter_num': 1e6           # number of iterations

Launch

running on Ascend

python train_eval.py > result.log 2>&1 &

The python command above will run in the background, you can view the results through the file result.log.

The regret value will be achieved as follows:

--> Step: 0, diff: 348.662, current_regret: 0.000, cumulative regret: 0.000
--> Step: 1, diff: 338.457, current_regret: 0.000, cumulative regret: 0.000
--> Step: 2, diff: 336.465, current_regret: 2.000, cumulative regret: 2.000
--> Step: 3, diff: 327.337, current_regret: 0.000, cumulative regret: 2.000
--> Step: 4, diff: 325.039, current_regret: 2.000, cumulative regret: 4.000
...

Model Description

The original paper assumes that the norm of user features is bounded by 1 and the norm of rating scores is bounded by 2. For the MovieLens dataset, we normalize rating scores to [-1,1]. Thus, we set sigma in Algorithm 5 to be $$4/epsilon * sqrt(2 * ln(1.25/delta))$$.

Performance

The performance for different privacy parameters:

x: number of iterations
y: cumulative regret

The performance compared with optimal non-private regret O(sqrt(T)):

x: number of iterations
y: cumulative regret divided by sqrt(T)

Description of Random Situation

In train_eval.py, we randomly sample a user at each round. We also add Gaussian noise to the date being transferred.

ModelZoo Homepage

Please check the official
homepage.

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

C++ Python Text Unity3D Asset C other

314202276@qq.com 5518576+mindspore_ci@user.noreply.gitee.com tommylike@qq.com zhaozhenlong1@huawei.com zhoufeng54@huawei.com sunsuodong@huawei.com wangkaisheng2@huawei.com shiliang10@huawei.com yangruoqi@huawei.com xiefangqi2@huawei.com caifubi1@huawei.com lingqiaomin.huawei.com fuzhiye@huawei.com chenweifeng720@huawei.com liubuyu1@huawei.com changzherui1@huawei.com guozhijian@huawei.com huanghui44@huawei.com peixu.ren1@huawei.com liuxiao93@huawei.com zhaoting23@huawei.com lizhenyu13@huawei.com yaoyifan1@huawei.com xuanyue@huawei.com yuchaojie1@huawei.com

README.md

Contents

Contributors (25+) All

Contributors (25+)
All