|
Nguyen-Khang PHAM Lecturer Can Tho University College of information technology 1 Ly Tu Trong, Can Tho City E-mail : pnkhang@cit.ctu.edu.vn |
I. SCRIPTS 1. image2sift.sh Compute SIFT for an image 2. dir2sift.sh Compute SIFT for a directory of images 3. sampling.sh Sample a subset of SIFT to create visual words using k-means Syntax: sampling.sh "SIFT-DIR" "RESULT-FILE" [size] -SIFT-DIR: directory containing SIFT file (output of dir2sift) -RESULT-FILE: file containing SIFT -SIZE: number of images used for sampling, by fault, 1/5 images in SIFT-DIR will be used 4. all-steps.sh: all of preprocessing steps used to create data for visualization II. BINARIES 1. extract_features.ln Compute SIFT for an image (program of Krystian.Mikolajczyk) 2. kmeans, kmeans-mt: k-means algorithm implemented by PHAM Nguyen-Khang Syntax kmeans [OPTION] "data file" "model file" OPTIONS: -i: Number of iterations [default: 30] -k : Number of clusters -b : Block size [number of lines to be processed per block] -m : File containing the min cluster of individus, [don't care] -p : Number of threads [default: 4] data file format: number_of_line number_of_column line1 line2 ... kmeans-mt: version multithreading of kmeans 3. assign, assign-mt: assign SIFTs of images into clusters to form the contingency table (used for Correspondence Analysis) Syntax: assign [OPTION] "list file" "model file" "result file" OPTIONS: -t : SIFT type [0: no region, 1: output of 'extract_feature' with option -o1] -d : SIFT region descriptor's directory -b : Block size [number of lines to be processed per block] -p : Number of threads [default: 4] list file: a file containing a list of filename Format: number of files (e.g. 10) file_name_1 file_name_2 ... file_name_10 Use the Linux command 'find' to create this file model file: clusters computed by k-means result file: contingency table 4. ca apply correspondence analysis on a contingency table Syntax: ca [OPTION] "data file" "output stem" OPTIONS: -k number of axes data file: contingency table output stemp: stemp for output, the output composes: _Z.txt: projection of lines _W.txt: projection of columns _P.txt: marginal probability of lines _Q.txt: marginal probability of columns _A.txt: transistion matrix for projection of lines, _B.txt: transistion matrix for projection of columns, Z = P^(-1)*F*A W = Q^(-1)*F'*B F: the contingency table, Q, P: the diagonal matrix