Irrelevant frame removal for scene analysis using video  hyperclique pattern and spectrum analysis

Yuchou Chang; Hong Lin

doi:10.14419/jacst.v5i1.4035

Authors and Affiliations

Yuchou Chang University of Wisconsin - Milwaukee
Hong Lin Yale University

About this article

DOI:

https://doi.org/10.14419/jacst.v5i1.4035

Received:

16-12-2014

Accepted:

12-01-2015

Published:

06-02-2016

Views:

542

Downloads:

246

Download PDF

Keywords:

Content-Based Video Retrieval, Fibonacci Lattice, Hyperclique Pattern, Irrelevant Removal, Log Spectrum.

Abstract

Video often include frames that are irrelevant to the scenes for recording. These are mainly due to imperfect shooting, abrupt movements of camera, or unintended switching of scenes. The irrelevant frames should be removed before the semantic analysis of video scene is performed for video retrieval. An unsupervised approach for automatic removal of irrelevant frames is proposed in this paper. A novel log-spectral representation of color video frames based on Fibonacci lattice-quantization has been developed for better description of the global structures of video contents to measure similarity of video frames. Hyperclique pattern analysis, used to detect redundant data in textual analysis, is extended to extract relevant frame clusters in color videos. A new strategy using the k-nearest neighbor algorithm is developed for generating a video frame support measure and an h-confidence measure on this hyperclique pattern based analysis method. Evaluation of the proposed irrelevant video frame removal algorithm reveals promising results for datasets with irrelevant frames.

Author Biography

Yuchou Chang, University of Wisconsin - Milwaukee

I am a PhD candidate in Electrical Engineering and Computer Science Department, University of Wisconsin - Milwaukee, USA. My research areas include pattern recognition, image processing, computer vision, biomedical imaging, machine learning, signal processing. I have around 30 peer-reviewed publications.

References

[1] A.Angelova, â€œData Pruningâ€, Thesis, California Institute of Technology, (2004).

[2] A. Angelova, Y. Abu-Mostafa, and P. Perona, â€œPruning Training Sets for Learning of Object Categoriesâ€, IEEE International Conference on Computer Vision and Pattern Recognition, vol.1, (2005), pp.494-501. http://dx.doi.org/10.1109/cvpr.2005.283.

[3] D. Angluin, and P. Laird, â€œLearning from Noisy Examplesâ€, Machine Learning, vol.2, no.4, (1988), pp.343-370. http://dx.doi.org/10.1007/BF00116829.

[4] R.A. Baeza-Yates, and B. Ribeiro-Neto, â€œModern Information Retrievalâ€, Addison-Wesley Longman Publishing, (1999).

[5] S. Basu, M. Naphade, and J.R. Smith, â€œA Statistical Modeling Approach to Content Based Retrievalâ€, IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.4, (2002), pp.480-483. http://dx.doi.org/10.1109/icassp.2002.5745554.

[6] Brodley, C.E., and Friedl, M.A., Identifying Mislabeled Training Data, Journal of Artificial Intelligence Research, vol.11, pp.131-167, 1999.

[7] Y. Chi, and M.K.H. Leung, â€œPart-Based Object Retrieval in Cluttered Environmentâ€, IEEE Transactions on Analysis and Machine Intelligence, vol.29, no.5, (2007), pp.890-895. http://dx.doi.org/10.1109/TPAMI.2007.1076.

[8] B.V. Dasarathy, â€œNearest Neighbor (NN) Norms: NN Pattern Classification Techniquesâ€, IEEE Computer Society Press, (1990).

[9] T. Deselaers, D. Keysers, and H. Ney, â€œClustering Visually Similar Images to Improve Image Search Enginesâ€, Informatiktage 2003 der Gesellschaft fÃ¼r Informatik, Germany, (2003).

[10] L.Y. Duan, M. Xu, Q. Tian, C.S. Xu, and J.S. Jin, â€œA Unified Framework for Semantic Shot Classification in Sports Videoâ€, IEEE Transactions on Multimedia, vol.7, no. 6, (2005), pp.1066-1083. http://dx.doi.org/10.1109/TMM.2005.858395.

[11] L. Ertoz, M. Steinbach, and V. Kumar, â€œFinding Clusters of Different Sizes, Shapes and Densities in Noisy, High Dimensional Dataâ€, Proceedings of SIAM International Conference on Data Mining, (2003). http://dx.doi.org/10.1137/1.9781611972733.5.

[12] M. Fischler, and R. Bolles, â€œRandom Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartographyâ€, Communications of the ACM, vol.24, no.6, (1981), pp.381-395. http://dx.doi.org/10.1145/358669.358692.

[13] S. Guha, R. Rastogi, and K. Shim, â€œCure: An Efficient Clustering Algorithm for Large Databasesâ€, In Proceedings of ACM SIGMOD International Conference on Management of Data, (1998), pp.73-84. http://dx.doi.org/10.1145/276304.276312.

[14] A.N. Hirani, and T. Totsuka, â€œCombining Frequency and Spatial Domain Information for Fast Interactive Image Noise Removalâ€, ACM International Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), (1996), pp.269-276. http://dx.doi.org/10.1145/237170.237264.

[15] A.K. Jain, â€œFundamentals of Digital Image Processingâ€, Prentice Hall Information and System Sciences Series, (1989).

[16] A.K. Jain, and R.C. Dubes, â€œAlgorithms for Clustering Dataâ€, Prentice Hall, (1998).

[17] K. Kaewbuadee, Y. Temtanapat, and R. Peachavanish, â€œData Cleaning Using FD From Data Mining Processâ€, IADIS International Journal on Computer Science and Information System, vol.1, (2006), pp.117-131.

[18] M. Kearns, and M. Li, â€œLearning in the Presence of Malicious Errorsâ€, Annual ACM Symposium on Theory of Computing, (1988), pp.267-280. http://dx.doi.org/10.1145/62212.62238.

[19] L.J. Li, G. Wang, and F.F. Li, â€œOPTIMOL: Automatic Online Picture Collection via Incremental Model Learningâ€, IEEE International Conference on Computer Vision and Pattern Recognition, (2007), pp.1-8. http://dx.doi.org/10.1109/cvpr.2007.383048.

[20] D. Makris, and T. Ellis, â€œLearning Semantic Scene Models From Observing Activity in Visual Surveillanceâ€, IEEE Transactions on Systems, Man, and Cybernetics-Part B, vol.35, no.3, (2005), pp.397-408. http://dx.doi.org/10.1109/TSMCB.2005.846652.

[21] A. Mojsilovic, and E. Soljanin, â€œColor Quantization and Processing by Fibonacci Latticesâ€, IEEE Transactions on Image Processing, vol.10, no.11, (2001), pp.1712-1725. http://dx.doi.org/10.1109/83.967399.

[22] A. Oliva, and A. Torralba, â€œModeling the Shape of the Scene: A Holistic Representation of the Spatial Envelopeâ€, International Journal of Computer Vision, vol.42, no.3, (2001), pp.145-175. http://dx.doi.org/10.1023/A:1011139631724.

[23] J. Sander, M. Ester, H.P. Kriegel, and X. Xu, â€œDensity-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applicationsâ€, International Journal of Data Mining and Knowledge Discovery, vol.2, no.2, (1998), pp.169-194. http://dx.doi.org/10.1023/A:1009745219419.

[24] G. Shakhnarovich, T. Darrell, and P. Indyk, â€œNearest-Neighbor Methods in Learning and Vision: Theory and Practiceâ€, MIT Press, (2005).

[25] C.G.M. Snoek, M. Worring, D.C. Koelma, and A.W.M. Smeulders, â€œA Learned Lexicon-Driven Paradigm for Interactive Video Retrievalâ€, IEEE Transactions on Multimedia, vol.9, no. 2, (2007), pp.280-292. http://dx.doi.org/10.1109/TMM.2006.886275.

[26] C.G.M. Snoek, B. Huurnink, L. Hollink, M. de Rijke, G. Schreiber, and M. Worring, â€œAdding Semantics to Detectors for Video Retrievalâ€, IEEE Transactions on Multimedia, vol.9, no.5, (2007), pp.975-986. http://dx.doi.org/10.1109/TMM.2007.900156.

[27] D.L. Swets, and J. Weng, â€œHierarchical Discriminant Analysis for Image Retrievalâ€, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.21, no.5, (1999), pp.386-401. http://dx.doi.org/10.1109/34.765652.

[28] A. Torralba, and A. Oliva, â€œDepth Estimation from Image Structureâ€, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, no. 9, (2002), pp.1226-1238. http://dx.doi.org/10.1109/TPAMI.2002.1033214.

[29] A. Torralba, and A. Oliva, â€œStatistics of Natural Image Categoriesâ€, Network: Computation in Neural Systems, vol.14, no.3, (2003), pp.391-412. http://dx.doi.org/10.1088/0954-898X_14_3_302.

[30] A. Torralba, â€œModeling Global Scene Factors in Attentionâ€, Journal of the Optical Society of America, vol.20, no.7, (2003), pp.1407-pp.1418.

[31] TRECVID, available online: http://www-nlpir.nist.gov/projects/trecvid/

[32] C.J. Wu, H.C. Zeng, S.H. Huang, S.H. Lai, and W.H. Wang, â€œLearning-Based Interactive Video Retrieval Systemâ€, IEEE International Conference on Multimedia and Expo, (2006), pp.1785-1788. http://dx.doi.org/10.1109/icme.2006.262898.

[33] H. Xiong, P.N. Tan, and V. Kumar, â€œMining Strong Affinity Association Patterns in Data Sets with Skewed Support Distributionâ€, IEEE International Conference on Data Mining, (2003), pp.387-394. http://dx.doi.org/10.1109/ICDM.2003.1250944.

[34] H. Xiong, G. Pandey, M. Steinbach, and V. Kumar, â€œEnhancing Data Analysis with Noise Removalâ€, IEEE Transactions on Knowledge and Data Engineering, vol.18, no.3, (2006), pp.304-319. http://dx.doi.org/10.1109/TKDE.2006.46.

[35] T. Zhang, R. Ramakrishnan, and M. Livny, â€œBIRCH: An Efficient Data Clustering Method for Very Large Databasesâ€, ACM SIGMOD International Conference on Management of Data, (1996), pp.103-114. http://dx.doi.org/10.1145/233269.233324.

How to Cite

Chang, Y., & Lin, H. (2016). Irrelevant frame removal for scene analysis using video hyperclique pattern and spectrum analysis. Journal of Advanced Computer Science & Technology, 5(1), 1-7. https://doi.org/10.14419/jacst.v5i1.4035

Download Citation

Irrelevant frame removal for scene analysis using video hyperclique pattern and spectrum analysis