/**
* Outputs the host or domain as key for this record and numInlinks,
* numOutlinks or score as the value.
*/
public void map(Text key, Node node,
OutputCollector<Text, FloatWritable> output, Reporter reporter)
throws IOException {
float number = 0;
if (inlinks) {
number = node.getNumInlinks();
} else if (outlinks) {
number = node.getNumOutlinks();
} else {
number = node.getInlinkScore();
}
if (host) {
key.set(URLUtil.getHost(key.toString()));
} else {
key.set(URLUtil.getDomainName(key.toString()));
}
output.collect(key, new FloatWritable(number));
}
NodeDumper.java 文件源码
java
阅读 28
收藏 0
点赞 0
评论 0
项目:GeoCrawler
作者:
评论列表
文章目录