And with wildcards it would be:
/user/inputLocation/*
And in case of nested partitions, we can have a combination of globs and wildcards, such as:
/user/inputLocation/{Partition-1,Partition-2, Partition-3,....Partition-n}/*
Pig能夠順利從對應位置讀取數(shù)據(jù),并在操作執(zhí)行當中對其加以優(yōu)化。如此一來,Pig面臨的惟一障礙就是集群資源的不可用性。除此之外,在對數(shù)據(jù)進行大量變換的情況下,Apache Pig的表現(xiàn)可以說無與倫比。
Hive與Pig間的對決
以下信息能夠幫助大家了解Hive與Pig各自的設計思路與工作原理。