@Deprecated public class DistributedLanczosSolver extends LanczosSolver implements org.apache.hadoop.util.Tool
SSVDSolver| Modifier and Type | Class and Description |
|---|---|
class |
DistributedLanczosSolver.DistributedLanczosSolverJob
Deprecated.
Inner subclass of AbstractJob so we get access to AbstractJob's functionality w.r.t.
|
LanczosSolver.TimingSection| Modifier and Type | Field and Description |
|---|---|
static String |
RAW_EIGENVECTORS
Deprecated.
|
SAFE_MAX| Constructor and Description |
|---|
DistributedLanczosSolver()
Deprecated.
|
| Modifier and Type | Method and Description |
|---|---|
org.apache.hadoop.conf.Configuration |
getConf()
Deprecated.
|
static Vector |
getInitialVector(VectorIterable corpus)
Deprecated.
For the distributed case, the best guess at a useful initialization state for Lanczos we'll chose to be
uniform over all input dimensions, L_2 normalized.
|
DistributedLanczosSolver.DistributedLanczosSolverJob |
job()
Deprecated.
|
static void |
main(String[] args)
Deprecated.
|
int |
run(org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputPath,
org.apache.hadoop.fs.Path outputTmpPath,
org.apache.hadoop.fs.Path workingDirPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank)
Deprecated.
Run the solver to produce the raw eigenvectors
|
int |
run(org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputPath,
org.apache.hadoop.fs.Path outputTmpPath,
org.apache.hadoop.fs.Path workingDirPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank,
double maxError,
double minEigenvalue,
boolean inMemory)
Deprecated.
Run the solver to produce raw eigenvectors, then run the EigenVerificationJob to clean them
|
int |
run(String[] strings)
Deprecated.
|
LanczosState |
runJob(org.apache.hadoop.conf.Configuration originalConfig,
LanczosState state,
int desiredRank,
boolean isSymmetric,
String outputEigenVectorPathString)
Deprecated.
|
LanczosState |
runJob(org.apache.hadoop.conf.Configuration originalConfig,
org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputTmpPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank,
String outputEigenVectorPathString)
Deprecated.
Factored-out LanczosSolver for the purpose of invoking it programmatically
|
void |
serializeOutput(LanczosState state,
org.apache.hadoop.fs.Path outputPath)
Deprecated.
|
void |
setConf(org.apache.hadoop.conf.Configuration configuration)
Deprecated.
|
calculateScaleFactor, orthoganalizeAgainstAllButLast, solve, solvepublic static final String RAW_EIGENVECTORS
public static Vector getInitialVector(VectorIterable corpus)
public LanczosState runJob(org.apache.hadoop.conf.Configuration originalConfig, LanczosState state, int desiredRank, boolean isSymmetric, String outputEigenVectorPathString) throws IOException
IOExceptionpublic LanczosState runJob(org.apache.hadoop.conf.Configuration originalConfig, org.apache.hadoop.fs.Path inputPath, org.apache.hadoop.fs.Path outputTmpPath, int numRows, int numCols, boolean isSymmetric, int desiredRank, String outputEigenVectorPathString) throws IOException
IOExceptionpublic int run(String[] strings) throws Exception
run in interface org.apache.hadoop.util.ToolExceptionpublic int run(org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputPath,
org.apache.hadoop.fs.Path outputTmpPath,
org.apache.hadoop.fs.Path workingDirPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank,
double maxError,
double minEigenvalue,
boolean inMemory)
throws Exception
inputPath - the Path to the input corpusoutputPath - the Path to the outputoutputTmpPath - a Path to a temporary working directorynumRows - the int number of rowsnumCols - the int number of columnsisSymmetric - true if the input matrix is symmetricdesiredRank - the int desired rank of eigenvectors to producemaxError - the maximum allowable errorminEigenvalue - the minimum usable eigenvalueinMemory - true if the verification can be done in memoryExceptionpublic int run(org.apache.hadoop.fs.Path inputPath,
org.apache.hadoop.fs.Path outputPath,
org.apache.hadoop.fs.Path outputTmpPath,
org.apache.hadoop.fs.Path workingDirPath,
int numRows,
int numCols,
boolean isSymmetric,
int desiredRank)
throws Exception
inputPath - the Path to the input corpusoutputPath - the Path to the outputoutputTmpPath - a Path to a temporary working directorynumRows - the int number of rowsnumCols - the int number of columnsisSymmetric - true if the input matrix is symmetricdesiredRank - the int desired rank of eigenvectors to produceExceptionpublic void serializeOutput(LanczosState state, org.apache.hadoop.fs.Path outputPath) throws IOException
state - The final LanczosState to be serializedoutputPath - The path (relative to the current Configuration's FileSystem) to save the output to.IOExceptionpublic void setConf(org.apache.hadoop.conf.Configuration configuration)
setConf in interface org.apache.hadoop.conf.Configurablepublic org.apache.hadoop.conf.Configuration getConf()
getConf in interface org.apache.hadoop.conf.Configurablepublic DistributedLanczosSolver.DistributedLanczosSolverJob job()
Copyright © 2008–2017 The Apache Software Foundation. All rights reserved.