Avaliação do Controle de Acesso de Múltiplos Usuários e Múltiplos Arquivos em um Ambiente Hadoop
Abstract
Massive processing of data is a reality for several computer systems. The security of processed data has great importance since the environment is typically shared among multiple users. This article presents an evaluation of the access control of multiple users and multiple files, considering the different control levels of a Hadoop environment (operating system, distributed file system and web interface). A test scenario is proposed and validated at different levels and different versions of a Hadoop distribution (Hortonworks). The versions presented the same behavior but we identified errors and differences between control levels.
References
Hadoop. “The Apache Hadoop.” http://hadoop.apache.org/.
HadoopIssues. “Hadoop Issues Tracking.” https://issues.apache.org/jira/browse/HADOOP.
HDFS. “Hadoop Distributed File System.” http://hadoop.apache.org/hdfs/.
HDFSIssues. “HDFS Issues Tracking.” https://issues.apache.org/jira/browse/HDFS.
Hortonworks. “Hortonworks: Open Enterprise Hadoop.” http://hortonworks.com.
Hue. “Hue - Hadoop User Experience - The Apache Hadoop UI.” http://gethue.com/.
Shvachko, Konstantin, Hairong Kuang, Sanjay Radia, and Robert Chansler. 2010. “The Hadoop Distributed File System.” In Proc. of the MSST - Symp. on Mass Storage Systems and Technologies, IEEE, 1–10.
Tabatabaei, Mahsa. 2014. “Evaluation of Security in Hadoop.” KTH Royal Institute of Technology.
Tankard, Colin. 2012. “Big Data Security.” Network Security 2012(7): 5–8.
Thusoo, Ashish, J.S. Sarma, Namit Jain, Zheng Shao, Prasad Chakka, Suresh Anthony, Hao Liu, Pete Wyckoff, and Raghotham Murthy. 2009. “Hive - A Warehousing Solution Over a Map-Reduce Framework.” Proceedings of the VLDB Endowment 2(2): 1626–29.
White, Tom. 2012. Hadoop: The Definitive Guide, 3rd Edition. 3rd ed. O’Reilly Media.
Zikopoulos, Paul C., Chris Eaton, Dirk DeRoos, Thomas Deutsch, and George Lapis. 2012. Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data. McGraw-Hill.
