Upgrade Cuda from 7.x to 8.0 on Ubuntu

1.  remove cuda 7.x version (x depends on what you installed.) rm /usr/local/cuda-7.x 2. make sure PATH and LD_LIBRARY_PATH no longer contain “/usr/local/cuda-7.x”. Possible places to look at are /etc/environment, ~/.profile, /etc/bash.bashrc, /etc/profile, ~/.bash_rc If you really don’t know where cuda path is added to PATH or LD_LIBRARY_PATH, try to check here: https://unix.stackexchange.com/questions/813/how-to-determine-where-an-environment-variable-came-from 3. cuda 8.0 …

English Grammars

“A” or “an” before an acronym or abbreviation? e.g., a FAQ or an FAQ? https://english.stackexchange.com/questions/1016/do-you-use-a-or-an-before-acronyms   When should I add “the” before what kind of noun? http://www.englishteachermelanie.com/grammar-when-not-to-use-the-definite-article/   Whether to repeat “the” in “noun and noun” phrases? http://english.stackexchange.com/questions/9487/is-it-necessary-to-use-the-multiple-times   “noun and noun” phrase: the following verb is plural or single? http://www.mhhe.com/mayfieldpub/tsw/nounsagr.htm   adj before “noun …

Inverse Reinforcement Learning

In my rough understanding, inverse reinforcement learning is a branch of RL research in which people try to perform state-action sequences resembling given tutor sequences. There are two famous works on inverse reinforcement learning. One is Apprenticeship Learning via Inverse Reinforcement Learning [1], and the other is Maximum Margin Planning [2]. Maximum Margin Planning In …

Reinforcement learning overview

Here are some materials I found useful to learn Reinforcement Learning (RL). Let’s first look at Markov Decision Process (MDP), in which you know a transition function $latex T(s,a,s’)$ and a reward function $latex R(s,a,s’)$. In the diagram below, the green state is called “q state”.  Some notations that need to be clarified: Dynamic programming …

Abstract Algebra

I am introducing some basic definitions of abstract algebra, structures like monoid, groups, rings, fields and vector spaces and homomorphism/isomorphism. I find the clear definitions of structures from [1]: Also, the tables below show a clear comparisons between several structures [2,3]:   All these structures are defined with both a set and operation(s). Based on [4], …

When A* algorithm returns optimal solution

Dijkstra algorithm is a well known algorithm for finding exact distance from a source to a destination. In order to improve the path finding speed, A* algorithm combines heuristics and known distances to find the heuristically best path towards a goal. A common A* implementation maintains an open set for discovered yet not evaluated nodes and a closed …

Install Tensorflow 0.12 with GPU support on AWS p2 instance

# for connection and file transfer ssh -i ~/Dropbox/research/aws_noisemodel_keypair.pem ubuntu@ec2-54-164-130-227.compute-1.amazonaws.com rsync –progress –delete -rave “ssh -i /home/czxttkl/Dropbox/research/aws_noisemodel_keypair.pem” /home/czxttkl/workspace/mymachinelearning/Python/LoLSynergyCounter ubuntu@ec2-54-164-130-227.compute-1.amazonaws.com:~/ sudo apt-get install python-pip python-dev pip install tensorflow-gpu   download and transfer cuda toolkit, then install  sudo dpkg -i cuda-repo-ubuntu1604-8-0-local_8.0.44-1_amd64.deb sudo apt-get update sudo apt-get install cuda   download and transfer cudnn, then install: tar xvzf cudnn-<your-version>.tgz sudo …