Define hadoop archives? What is the command for archiving a group of files in hdfs.
Answer / Uma
Hadoop Archives (ARC) are an extension to the standard Hadoop file system that allow bundling of multiple files and directories into a single archived unit. This can be useful for managing large numbers of related files. To create an archive, use the command `hadoop archives -create arc_file path/to/files`. To extract an archive, use `hadoop archives -extract arc_file path/to/destination`.
| Is This Answer Correct ? | 0 Yes | 0 No |
HDFS is used for applications with large data sets, not why Many small files?
What is a block?
Why HDFS stores data using commodity hardware despite the higher chance of failures?
What is the problem in having lots of small files in hdfs?
How can one set space quota in Hadoop (HDFS) directory?
What is a namenode in hadoop?
Why is block size set to 128 MB in Hadoop HDFS?
Tell me two most commonly used commands in HDFS?
If a particular file is 50 mb, will the hdfs block still consume 64 mb as the default size?
What is Secondary NameNode in Hadoop HDFS?
What are file permissions in HDFS? how does HDFS check permissions for files/directory?
Write the command to copy a file from linux to hdfs?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)