+6 votes
in Big Data Hadoop & Spark by (1.5k points)

Can someone explain the basic difference that distinguishes s3n, s3a and s3 in Hadoop? Technically how are they different?

2 Answers

+15 votes
by (13.2k points)

S3 is a block-based overlay on top of Amazon S3, whereas  S3N/S3A are not.

These are are object-based. The difference between S3 and S3N/S3A is that S3 is a block-based overlay on top of Amazon S3, while S3N or the S3A are not because of them being more object based.

S3A is the successor to S3N. The difference between S3N and S3A is that S3N supports objects up to 5GB in size, while S3A supports objects within 5TB and it has much higher performance.

0 votes
by (33.2k points)
s3n is for use with files that are natively collected in S3 as individual S3 objects whereas s3 is used for accessing files that are collected within HDFS blocks and those blocks automatically are stored or backed in S3.

For more information regarding the same, refer the following video:

Welcome to Intellipaat Community. Get your technical queries answered by top developers !