WholeTextFileInputFormat (Spark 1.2.1 JavaDoc)

Object
- org.apache.hadoop.mapreduce.InputFormat<K,V>
- - org.apache.hadoop.mapreduce.lib.input.FileInputFormat<K,V>
  - - org.apache.hadoop.mapreduce.lib.input.CombineFileInputFormat<String,String>
    - - org.apache.spark.input.WholeTextFileInputFormat

All Implemented Interfaces:

org.apache.hadoop.conf.Configurable
```
public class WholeTextFileInputFormat
extends org.apache.hadoop.mapreduce.lib.input.CombineFileInputFormat<String,String>
implements org.apache.hadoop.conf.Configurable
```
A CombineFileInputFormat for reading whole text files. Each file is read as key-value pair, where the key is the file path and the value is the entire content of file.

Nested Class Summary
- Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat
  org.apache.hadoop.mapreduce.lib.input.FileInputFormat.Counter

Field Summary
- Fields inherited from class org.apache.hadoop.mapreduce.lib.input.CombineFileInputFormat
  SPLIT_MINSIZE_PERNODE, SPLIT_MINSIZE_PERRACK

Constructor Summary

Constructors
Constructor and Description

WholeTextFileInputFormat()

Constructors
Constructor and Description
`WholeTextFileInputFormat()`

Method Summary

Methods
Modifier and Type	Method and Description
`org.apache.hadoop.mapreduce.RecordReader<String,String>`	`createRecordReader(org.apache.hadoop.mapreduce.InputSplit split, org.apache.hadoop.mapreduce.TaskAttemptContext context)`
`org.apache.hadoop.conf.Configuration`	`getConf()`
`void`	`setConf(org.apache.hadoop.conf.Configuration c)`
`void`	`setMinPartitions(org.apache.hadoop.mapreduce.JobContext context, int minPartitions)` Allow minPartitions set by end-user in order to keep compatibility with old Hadoop API, which is set through setMaxSplitSize

Methods inherited from class org.apache.hadoop.mapreduce.lib.input.CombineFileInputFormat
getSplits

Methods inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat
addInputPath, addInputPaths, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize

Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Constructor Detail
- WholeTextFileInputFormat
```
public WholeTextFileInputFormat()
```

Method Detail

setConf
```
public void setConf(org.apache.hadoop.conf.Configuration c)
```
Specified by:

setConf in interface org.apache.hadoop.conf.Configurable

getConf
```
public org.apache.hadoop.conf.Configuration getConf()
```
Specified by:

getConf in interface org.apache.hadoop.conf.Configurable

createRecordReader

public org.apache.hadoop.mapreduce.RecordReader<String,String> createRecordReader(org.apache.hadoop.mapreduce.InputSplit split,
                                                                         org.apache.hadoop.mapreduce.TaskAttemptContext context)

Specified by:: createRecordReader in class org.apache.hadoop.mapreduce.lib.input.CombineFileInputFormat<String,String>

setMinPartitions
```
public void setMinPartitions(org.apache.hadoop.mapreduce.JobContext context,
                    int minPartitions)
```
Allow minPartitions set by end-user in order to keep compatibility with old Hadoop API, which is set through setMaxSplitSize

Class WholeTextFileInputFormat

Nested Class Summary

Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

Field Summary

Fields inherited from class org.apache.hadoop.mapreduce.lib.input.CombineFileInputFormat

Constructor Summary

Method Summary

Methods inherited from class org.apache.hadoop.mapreduce.lib.input.CombineFileInputFormat

Methods inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

Methods inherited from class Object

Constructor Detail

WholeTextFileInputFormat

Method Detail

setConf

getConf

createRecordReader

setMinPartitions