A Framework for Dimensioning the Total Population in Networks Composed of Rare and Clustered Populations

  • Camila D. da Silva UFF
  • Antonio A. de A. Rocha UFF

Abstract


Dimensioning the total population in networks whose structure is composed of a rare and clustered population is not a trivial task. In these circumstances, it is usual to overlay a grid over the region in which the population is contained and select cells from that grid. A framework called the 2-Layer and 2-Estimator Method - 2L2EM was proposed and it implements the Multiple Capture and Recapture Method - MCRM in layer 1 to obtain the estimates within the cell to be used as input for the Adaptive Cluster Sampling - ACS which is layer 2 in which the total population in the network is estimated. The studies with synthetic and real data reveal that 2L2EM provides relevant estimates in relation to ACS and MCRM.

Keywords: Multiple Capture and Recapture Method, Adaptive Cluster Sampling, Horvitz-Thompson Estimator, Hansen-Hurwitz Estimator, Schnabel Estimator

References

Accettura, N., Neglia, G., and Grieco, L. A. (2015). The capture-recapture approach for population estimation in computer networks. Computer Networks, 89:107–122.

Dalal, S. R. and Mallows, C. L. (1990). Some graphical aids for deciding when to stop testing software. IEEE Journal on Selected Areas in Communications, 8(2):169–175.

El Emam, K. and Laitenberger, O. (2001). Evaluating capture-recapture models with two inspectors. IEEE Transactions on Software Engineering, 27(9):851–864.

Hansen, M. H. and Hurwitz, W. N. (1943). On the theory of sampling from finite populations. The Annals of Mathematical Statistics, 14(4):333–362.

Horvitz, D. G. and Thompson, D. J. (1952). A generalization of sampling without replacement from a finite universe. Journal of the American Statistical Association, 47(260):663–685.

Peng, S., Li, S., et al. (2009). Estimation of a population size in large-scale wireless sensor networks. Journal of Computer Science and Technology, 24(5):987–997.

Schnabel, Z. E. (1938). The estimation of the total fish population of a lake. The American Mathematical Monthly, 45(6):348–352.

Singham, D. I. (2010). Analysis of Sequential Stopping Rules for Simulation Experiments. PhD thesis, University of California, Berkeley, USA.

Smith, P. J. (1988). Bayesian methods for multiple capture-recapture surveys. Biometrics, pages 1177–1189.

Thompson, S. K. (1990). Adaptive cluster sampling. Journal of the American Statistical Association, 85(412):1050–1059.

Turk, P. and Borkowski, J. J. (2005). A review of adaptive cluster sampling: 1990–2003. Environmental and Ecological Statistics, 12(1):55–94.
Published
2022-07-31
SILVA, Camila D. da; ROCHA, Antonio A. de A.. A Framework for Dimensioning the Total Population in Networks Composed of Rare and Clustered Populations. In: WORKSHOP ON PERFORMANCE OF COMPUTER AND COMMUNICATION SYSTEMS (WPERFORMANCE), 21. , 2022, Niterói. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2022 . p. 60-71. ISSN 2595-6167. DOI: https://doi.org/10.5753/wperformance.2022.223259.