MH-1M: One of The Most Comprehensive and Up-to-Date Dataset for Advanced Android Malware Detection

  • Hendrio Bragança UFAM
  • Vanderson Rocha UFAM
  • Joner Assolin UFAM
  • Diego Kreutz UNIPAMPA
  • Eduardo Feitosa UFAM


We introduce MH-1M, one of the most comprehensive and up-to-date dataset for advanced Android malware research. This dataset includes 1,340,515 applications, covering diverse features and extensive sets of metadata. For precise malware assessment, we utilize the VirusTotal API, integrating multiple detection methods to ensure reliable outcomes. Our GitHub repository offers users access to the processed dataset and associated metadata, totaling over 400GB. This includes comprehensive outputs from the feature extraction process and VirusTotal metadata files. Our findings underscore the important role of the MH-1M dataset as an invaluable resource for understanding the evolving landscape of malware.


BRAGANÇA, Hendrio; ROCHA, Vanderson; ASSOLIN, Joner; KREUTZ, Diego; FEITOSA, Eduardo. MH-1M: One of The Most Comprehensive and Up-to-Date Dataset for Advanced Android Malware Detection. In: SIMPÓSIO BRASILEIRO DE SEGURANÇA DA INFORMAÇÃO E DE SISTEMAS COMPUTACIONAIS (SBSEG), 24. , 2024, São José dos Campos/SP. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 843-849. DOI:

