Análise Estatística do Trafego de E-Mail para o Projeto de um Cluster de Alto Desempenho

  • Antonio Carlos Oliveira Amorim IPT
  • Daniel Kobayashi Imori Bastion Systems
  • Sérgio Takeo Kofuji USP
  • Vidal Zapparoli Melo USP

Resumo


This paper shows an e-mail statistical analysis and the behavior of its users based on real servers utilization. The work is related to an specification project, under development in lnstitute for Technological Research (IPT-Brasil), for a high scalability and availability e-mail service ( above 100.000 users). The statistical results are a important step to the system performance evaluation, allowing an implementation of realistic workload simulations. The data collected are related in three categories: the amount of messages per user, the time interval between messages and the size of each message. The time intervals and the size of messages are calculate in accordance to a model thats use probability density functions fitting the real data, scaled to desired quantity of users. To carry through the simulations, Java applications will be implemented to generate messages with the results that best fits the realistic workload. The work intends to simulare the requisitions generated by the users, to measure the system capacity according the servers quantity and performance, to characterize servers utilization pattems and users behavior in arder to determine the restrictive performance factors.

Referências

ZHANG, Wensong, ZHANG, Wenzhuo "Linux Virtual Server Clusters - Build highly-scalable and highly-available network services at low cost", Linux Magazine, Nov. 2003.

MENASCÉ, D.A. "Loading Testing of Web sites", IEEE Internet computing, IEEE, Jul./Ago. 2002.

BERLOTTI, L; CALZAROSSA, M.C. "Models of Mail Server Workloads", Performance Evaluation 46, Elsevier, pp 65-76, 2001

BERLOTTI, L; CALZAROSSA M.C. "Workload Characterization of Mail Servers", Proceedings of SPECTS2000, Vancouver, Canadá, Jul. 2000.

ZHANG, Wensong,, "IPVS Home Page", Internet, Wensong, " http://www.linuxvirtualserver.org ", Acessado em 9 jan. 2005.

ROBERTSON, A, "Highly-affordable High Availability", Linux Magazine, Nov. 2003.

AMORIM, A.C. O., "Ajuste de Funções com Algoritmos Genéticos", disponível em [link], Acessado em 1 set. 2005.

SAITO, Yasushi; BERSHAD, Brian; LEVI, Henry. Manageability, availability and performance in Porcupine: a highly scalable, cluster based mail server. University of Washington. Dez. 1999.

BEHREN, J.R.V., CZERWINSKI, S., Joseph, A.D,. BREWER, E.A., KUBIATOWICZ, J., "NinjaMail: The design of a High-Performance clustered, Distributed E-mail System", IEEE, 2000.

MITCHELL, M., "An lntroduction to Genetic Algorithms", Cambridge : MIT, 1996

MILONE, G, "Estatística Geral e aplicada", Thomson, São Paulo, 2004.

MENASCÉ, D. A., ALMEIDA, "Planejamento de Capacidade para Serviços na Web", Campus, Rio de Janeiro. 2002.
Publicado
24/10/2005
AMORIM, Antonio Carlos Oliveira; IMORI, Daniel Kobayashi; KOFUJI, Sérgio Takeo; MELO, Vidal Zapparoli. Análise Estatística do Trafego de E-Mail para o Projeto de um Cluster de Alto Desempenho. In: SIMPÓSIO EM SISTEMAS COMPUTACIONAIS DE ALTO DESEMPENHO (SSCAD), 6. , 2005, Rio de Janeiro. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2005 . p. 17-24. DOI: https://doi.org/10.5753/wscad.2005.18971.