Examinando por Autor "Carmona, Enrique J."
Mostrando 1 - 20 de 32
Resultados por página
Opciones de ordenación
Publicación A block-based model for monitoring of human activity(Elsevier, 2011-03) Folgado Zuñiga, Encarnación; Carmona, Enrique J.; Rincón Zamorano, Mariano; Bachiller Mayoral, MargaritaThe study of human activity is applicable to a large number of science and technology fields, such as surveillance, biomechanics or sports applications. This article presents BB6-HM, a block-based human model for real-time monitoring of a large number of visual events and states related to human activity analysis, which can be used as components of a library to describe more complex activities in such important areas as surveillance, for example, luggage at airports, clients’ behaviour in banks and patients in hospitals. BB6-HM is inspired by the proportionality rules commonly used in Visual Arts, i.e., for dividing the human silhouette into six rectangles of the same height. The major advantage of this proposal is that analysis of the human can be easily broken down into regions, so that we can obtain information of activities. The computational load is very low, so it is possible to define a very fast implementation. Finally, this model has been applied to build classifiers for the detection of primitive events and visual attributes using heuristic rules and machine learning techniques.Publicación A block-based model for monitoring of human activity(Elsevier, 2011-03) Folgado Zuñiga, Encarnación; Carmona, Enrique J.; Rincón Zamorano, Mariano; Bachiller Mayoral, MargaritaThe study of human activity is applicable to a large number of science and technology fields, such as surveillance, biomechanics or sports applications. This article presents BB6-HM, a block-based human model for real-time monitoring of a large number of visual events and states related to human activity analysis, which can be used as components of a library to describe more complex activities in such important areas as surveillance, for example, luggage at airports, clients’ behaviour in banks and patients in hospitals. BB6-HM is inspired by the proportionality rules commonly used in Visual Arts, i.e., for dividing the human silhouette into six rectangles of the same height. The major advantage of this proposal is that analysis of the human can be easily broken down into regions, so that we can obtain information of activities. The computational load is very low, so it is possible to define a very fast implementation. Finally, this model has been applied to build classifiers for the detection of primitive events and visual attributes using heuristic rules and machine learning techniques.Publicación A historical perspective of algorithmic lateral inhibition and accumulative computation in computer vision(Elsevier, 2011-03) Fernández Caballero, Antonio; Carmona, Enrique J.; Delgado, Ana Esperanza; López López, Carmen MaríaCertainly, one of the prominent ideas of Professor José Mira was that it is absolutely mandatory to specify the mechanisms and/or processes underlying each task and inference mentioned in an architecture in order to make operational that architecture. The conjecture of the last fifteen years of joint research has been that any bottom-up organization may be made operational using two biologically inspired methods called “algorithmic lateral inhibition”, a generalization of lateral inhibition anatomical circuits, and “accumulative computation”, a working memory related to the temporal evolution of the membrane potential. This paper is dedicated to the computational formulation of both methods. Finally, all of the works of our group related to this methodological approximation are mentioned and summarized, showing that all of them support the validity of this approximation.Publicación A historical perspective of algorithmic lateral inhibition and accumulative computation in computer vision(Elsevier, 2011-03) Fernández Caballero, Antonio; Carmona, Enrique J.; Delgado, Ana Esperanza; López López, María DoloresCertainly, one of the prominent ideas of Professor José Mira was that it is absolutely mandatory to specify the mechanisms and/or processes underlying each task and inference mentioned in an architecture in order to make operational that architecture. The conjecture of the last fifteen years of joint research has been that any bottom-up organization may be made operational using two biologically inspired methods called “algorithmic lateral inhibition”, a generalization of lateral inhibition anatomical circuits, and “accumulative computation”, a working memory related to the temporal evolution of the membrane potential. This paper is dedicated to the computational formulation of both methods. Finally, all of the works of our group related to this methodological approximation are mentioned and summarized, showing that all of them support the validity of this approximation.Publicación A new video segmentation method of moving objects based on blob-level knowledge(Elsevier, 2008-02-01) Carmona, Enrique J.; Martínez Campos, Javier; Mira Mira, JoséVariants of the background subtraction method are broadly used for the detection of moving objects in video sequences in different applications. In this work we propose a new approach to the background subtraction method which operates in the colour space and manages the colour information in the segmentation process to detect and eliminate noise. This new method is combined with blob-level knowledge associated with different types of blobs that may appear in the foreground. The idea is to process each pixel differently according to the category to which it belongs: real moving objects, shadows, ghosts, reflections, fluctuation or background noise. Thus, the foreground resulting from processing each image frame is refined selectively, applying at each instant the appropriate operator according to the type of noise blob we wish to eliminate. The approach proposed is adaptive, because it allows both the background model and threshold model to be updated. On the one hand, the results obtained confirm the robustness of the method proposed in a wide range of different sequences and, on the other hand, these results underline the importance of handling three colour components in the segmentation process rather than just the one grey-level component.Publicación A new video segmentation method of moving objects based on blob-level knowledge(Elsevier, 2008-02-01) Carmona, Enrique J.; Martínez Campos, Javier; Mira Mira, JoséVariants of the background subtraction method are broadly used for the detection of moving objects in video sequences in different applications. In this work we propose a new approach to the background subtraction method which operates in the colour space and manages the colour information in the segmentation process to detect and eliminate noise. This new method is combined with blob-level knowledge associated with different types of blobs that may appear in the foreground. The idea is to process each pixel differently according to the category to which it belongs: real moving objects, shadows, ghosts, reflections, fluctuation or background noise. Thus, the foreground resulting from processing each image frame is refined selectively, applying at each instant the appropriate operator according to the type of noise blob we wish to eliminate. The approach proposed is adaptive, because it allows both the background model and threshold model to be updated. On the one hand, the results obtained confirm the robustness of the method proposed in a wide range of different sequences and, on the other hand, these results underline the importance of handling three colour components in the segmentation process rather than just the one grey-level component.Publicación A Survey of Video Datasets for Human Action and Activity Recognition(Elsevier, 2013-06) Chaquet, José M.; Carmona, Enrique J.; Fernández Caballero, AntonioVision-based human action and activity recognition has an increasing importance among the computer vision community with applications to visual surveillance, video retrieval and human–computer interaction. In recent years, more and more datasets dedicated to human action and activity recognition have been created. The use of these datasets allows us to compare different recognition systems with the same input data. The survey introduced in this paper tries to cover the lack of a complete description of the most important public datasets for video-based human activity and action recognition and to guide researchers in the election of the most suitable dataset for benchmarking their algorithms.Publicación A Survey of Video Datasets for Human Action and Activity Recognition(Elsevier, 2013-06) Chaquet, José M.; Carmona, Enrique J.; Fernández Caballero, AntonioVision-based human action and activity recognition has an increasing importance among the computer vision community with applications to visual surveillance, video retrieval and human–computer interaction. In recent years, more and more datasets dedicated to human action and activity recognition have been created. The use of these datasets allows us to compare different recognition systems with the same input data. The survey introduced in this paper tries to cover the lack of a complete description of the most important public datasets for video-based human activity and action recognition and to guide researchers in the election of the most suitable dataset for benchmarking their algorithms.Publicación Automatic design of analog electronic circuits using grammatical evolution(Elsevier, 2018-01) Castejón, Federico; Carmona, Enrique J.A new approach for automatic synthesis of analog electronic circuits based on grammatical evolution is presented. Grammatical evolution is an evolutionary algorithm based on grammar which can generate code in any programming language and uses variable length linear binary strings. The decoding of each chromosome determines which production rules in a Backus-Naur Form grammar definition are used in a genotype-to-phenotype mapping process. In our method, decoding focuses on obtaining circuit netlists. A new grammar for generating such netlists and a variant of the XOSites-based crossover operator are also presented. A post-processing stage is needed to adapt the decoded netlist prior its evaluation using the NGSpice simulator. Our approach was applied to several case studies, comprising a total of seven benchmark circuits. A comparison with previous works in the literature shows that our method produces competitive circuits in relation to the degree of compliance with the output specifications, the number of components and the number of evaluations used in the evolutionary process.Publicación Automatic design of analog electronic circuits using grammatical evolution(Elsevier, 2018-01) Castejón, Federico; Carmona, Enrique J.A new approach for automatic synthesis of analog electronic circuits based on grammatical evolution is presented. Grammatical evolution is an evolutionary algorithm based on grammar which can generate code in any programming language and uses variable length linear binary strings. The decoding of each chromosome determines which production rules in a Backus-Naur Form grammar definition are used in a genotype-to-phenotype mapping process. In our method, decoding focuses on obtaining circuit netlists. A new grammar for generating such netlists and a variant of the XOSites-based crossover operator are also presented. A post-processing stage is needed to adapt the decoded netlist prior its evaluation using the NGSpice simulator. Our approach was applied to several case studies, comprising a total of seven benchmark circuits. A comparison with previous works in the literature shows that our method produces competitive circuits in relation to the degree of compliance with the output specifications, the number of components and the number of evaluations used in the evolutionary process.Publicación Deformable registration of multimodal retinal images using a weakly supervised deep learning approach(Springer, 2023-03-28) Martínez Río, Javier; Carmona, Enrique J.; Cancelas, Daniel; Novo, Jorge; Ortega, MarcosThere are different retinal vascular imaging modalities widely used in clinical practice to diagnose different retinal pathologies. The joint analysis of these multimodal images is of increasing interest since each of them provides common and complementary visual information. However, if we want to facilitate the comparison of two images, obtained with different techniques and containing the same retinal region of interest, it will be necessary to make a previous registration of both images. Here, we present a weakly supervised deep learning methodology for robust deformable registration of multimodal retinal images, which is applied to implement a method for the registration of fluorescein angiography (FA) and optical coherence tomography angiography (OCTA) images. This methodology is strongly inspired by VoxelMorph, a general unsupervised deep learning framework of the state of the art for deformable registration of unimodal medical images. The method was evaluated in a public dataset with 172 pairs of FA and superficial plexus OCTA images. The degree of alignment of the common information (blood vessels) and preservation of the non-common information (image background) in the transformed image were measured using the Dice coefficient (DC) and zero-normalized cross-correlation (ZNCC), respectively. The average values of the mentioned metrics, including the standard deviations, were DC = 0.72 ± 0.10 and ZNCC = 0.82 ± 0.04. The time required to obtain each pair of registered images was 0.12 s. These results outperform rigid and deformable registration methods with which our method was compared.Publicación Deformable registration of multimodal retinal images using a weakly supervised deep learning approach(Springer, 2023-03-28) Martínez Río, Javier; Carmona, Enrique J.; Cancelas, Daniel; Novo, Jorge; Ortega, MarcosThere are different retinal vascular imaging modalities widely used in clinical practice to diagnose different retinal pathologies. The joint analysis of these multimodal images is of increasing interest since each of them provides common and complementary visual information. However, if we want to facilitate the comparison of two images, obtained with different techniques and containing the same retinal region of interest, it will be necessary to make a previous registration of both images. Here, we present a weakly supervised deep learning methodology for robust deformable registration of multimodal retinal images, which is applied to implement a method for the registration of fluorescein angiography (FA) and optical coherence tomography angiography (OCTA) images. This methodology is strongly inspired by VoxelMorph, a general unsupervised deep learning framework of the state of the art for deformable registration of unimodal medical images. The method was evaluated in a public dataset with 172 pairs of FA and superficial plexus OCTA images. The degree of alignment of the common information (blood vessels) and preservation of the non-common information (image background) in the transformed image were measured using the Dice coefficient (DC) and zero-normalized cross-correlation (ZNCC), respectively. The average values of the mentioned metrics, including the standard deviations, were DC = 0.72 ± 0.10 and ZNCC = 0.82 ± 0.04. The time required to obtain each pair of registered images was 0.12 s. These results outperform rigid and deformable registration methods with which our method was compared.Publicación Fast detection of the main anatomical structures in digital retinal images based on intra-and inter-structure relational knowledge(Elsevier, 2017-10) Molina Casado, José María; Carmona, Enrique J.; García Feijoó, JuliánBackground and objective: The anatomical structure detection in retinal images is an open problem. However, most of the works in the related literature are oriented to the detection of each structure individually or assume the previous detection of a structure which is used as a reference. The objective of this paper is to obtain simultaneous detection of the main retinal structures (optic disc, macula, network of vessels and vascular bundle) in a fast and robust way. Methods: We propose a new methodology oriented to accomplish the mentioned objective. It consists of two stages. In an initial stage, a set of operators is applied to the retinal image. Each operator uses intra-structure relational knowledge in order to produce a set of candidate blobs that belongs to the desired structure. In a second stage, a set of tuples is created, each of which contains a different combination of the candidate blobs. Next, filtering operators, using inter-structure relational knowledge, are used in order to find the winner tuple. A method using template matching and mathematical morphology is implemented following the proposed methodology. Results: A success is achieved if the distance between the automatically detected blob center and the actual structure center is less than or equal to one optic disc radius. The success rates obtained in the different public databases analyzed were: MESSIDOR (99.33%, 98.58%, 97.92%), DIARETDB1 (96.63%, 100%, 97.75%), DRIONS (100%, n/a, 100%) and ONHSD (100%, 98.85%, 97.70%) for optic disc (OD), macula (M) and vascular bundle (VB), respectively. Finally, the overall success rate obtained in this study for each structure was: 99.26% (OD), 98.69% (M) and 98.95% (VB). The average time of processing per image was 4.16 ± 0.72 s. Conclusions: The main advantage of the use of inter-structure relational knowledge was the reduction of the number of false positives in the detection process. The implemented method is able to simultaneously detect four structures. It is fast, robust and its detection results are competitive in relation to other methods of the recent literature.Publicación Fast detection of the main anatomical structures in digital retinal images based on intra-and inter-structure relational knowledge(Elsevier, 2017-10) Molina Casado, José María; Carmona, Enrique J.; García Feijoó, JuliánBackground and objective: The anatomical structure detection in retinal images is an open problem. However, most of the works in the related literature are oriented to the detection of each structure individually or assume the previous detection of a structure which is used as a reference. The objective of this paper is to obtain simultaneous detection of the main retinal structures (optic disc, macula, network of vessels and vascular bundle) in a fast and robust way. Methods: We propose a new methodology oriented to accomplish the mentioned objective. It consists of two stages. In an initial stage, a set of operators is applied to the retinal image. Each operator uses intra-structure relational knowledge in order to produce a set of candidate blobs that belongs to the desired structure. In a second stage, a set of tuples is created, each of which contains a different combination of the candidate blobs. Next, filtering operators, using inter-structure relational knowledge, are used in order to find the winner tuple. A method using template matching and mathematical morphology is implemented following the proposed methodology. Results: A success is achieved if the distance between the automatically detected blob center and the actual structure center is less than or equal to one optic disc radius. The success rates obtained in the different public databases analyzed were: MESSIDOR (99.33%, 98.58%, 97.92%), DIARETDB1 (96.63%, 100%, 97.75%), DRIONS (100%, n/a, 100%) and ONHSD (100%, 98.85%, 97.70%) for optic disc (OD), macula (M) and vascular bundle (VB), respectively. Finally, the overall success rate obtained in this study for each structure was: 99.26% (OD), 98.69% (M) and 98.95% (VB). The average time of processing per image was 4.16 ± 0.72 s. Conclusions: The main advantage of the use of inter-structure relational knowledge was the reduction of the number of false positives in the detection process. The implemented method is able to simultaneously detect four structures. It is fast, robust and its detection results are competitive in relation to other methods of the recent literature.Publicación Identification of the optic nerve head with genetic algorithms(Elsevier, 2008-07) Carmona, Enrique J.; García Feijoó, Julián; Martínez de la Casa, José M.; Rincón Zamorano, MarianoObjective This work proposes creating an automatic system to locate and segment the optic nerve head (ONH) in eye fundus photographic images using genetic algorithms. Methods and material Domain knowledge is used to create a set of heuristics that guide the various steps involved in the process. Initially, using an eye fundus colour image as input, a set of hypothesis points was obtained that exhibited geometric properties and intensity levels similar to the ONH contour pixels. Next, a genetic algorithm was used to find an ellipse containing the maximum number of hypothesis points in an offset of its perimeter, considering some constraints. The ellipse thus obtained is the approximation to the ONH. The segmentation method is tested in a sample of 110 eye fundus images, belonging to 55 patients with glaucoma (23.1%) and eye hypertension (76.9%) and random selected from an eye fundus image base belonging to the Ophthalmology Service at Miguel Servet Hospital, Saragossa (Spain). Results and conclusions The results obtained are competitive with those in the literature. The method's generalization capability is reinforced when it is applied to a different image base from the one used in our study and a discrepancy curve is obtained very similar to the one obtained in our image base. In addition, the robustness of the method proposed can be seen in the high percentage of images obtained with a discrepancy δ < 5 (96% and 99% in our and a different image base, respectively). The results also confirm the hypothesis that the ONH contour can be properly approached with a non-deformable ellipse. Another important aspect of the method is that it directly provides the parameters characterising the shape of the papilla: lengths of its major and minor axes, its centre of location and its orientation with regard to the horizontal position.Publicación Identification of the optic nerve head with genetic algorithms(Elsevier, 2008-07) Carmona, Enrique J.; García Feijoó, Julián; Martínez de la Casa, José M.; Rincón Zamorano, MarianoObjective This work proposes creating an automatic system to locate and segment the optic nerve head (ONH) in eye fundus photographic images using genetic algorithms. Methods and material Domain knowledge is used to create a set of heuristics that guide the various steps involved in the process. Initially, using an eye fundus colour image as input, a set of hypothesis points was obtained that exhibited geometric properties and intensity levels similar to the ONH contour pixels. Next, a genetic algorithm was used to find an ellipse containing the maximum number of hypothesis points in an offset of its perimeter, considering some constraints. The ellipse thus obtained is the approximation to the ONH. The segmentation method is tested in a sample of 110 eye fundus images, belonging to 55 patients with glaucoma (23.1%) and eye hypertension (76.9%) and random selected from an eye fundus image base belonging to the Ophthalmology Service at Miguel Servet Hospital, Saragossa (Spain). Results and conclusions The results obtained are competitive with those in the literature. The method's generalization capability is reinforced when it is applied to a different image base from the one used in our study and a discrepancy curve is obtained very similar to the one obtained in our image base. In addition, the robustness of the method proposed can be seen in the high percentage of images obtained with a discrepancy δ < 5 (96% and 99% in our and a different image base, respectively). The results also confirm the hypothesis that the ONH contour can be properly approached with a non-deformable ellipse. Another important aspect of the method is that it directly provides the parameters characterising the shape of the papilla: lengths of its major and minor axes, its centre of location and its orientation with regard to the horizontal position.Publicación Introducing modularity and homology in grammatical evolution to address the analog electronic circuit design problem(IEEE, 2020-08-24) Castejón, Federico; Carmona, Enrique J.We present a new approach based on grammatical evolution (GE) aimed at addressing the analog electronic circuit design problem. In the new approach, called multi-grammatical evolution (MGE), a chromosome is a variable-length codon string that is divided into as many partitions as subproblems result from breaking down the original optimization problem: circuit topology and component sizing in our case. This leads to a modular approach where the solution of each subproblem is encoded and evolved in a partition of the chromosome. Additionally, each partition is decoded according to a specific grammar and the final solution to the original problem emerges as an aggregation result associated with the decoding process of the different partitions. Modularity facilitates the encoding and evolution of the solution in each subproblem. On the other way, homology helps to reduce the potentially destructive effect associated with standard crossover operators normally used in GE-based approaches. Seven analog circuit designs are addressed by an MGE-based method and the obtained results are compared to those obtained by different methods based on GE and other evolutionary paradigms. A simple parsimony mechanism was also introduced to ensure compliance with design specifications and reduce the number of components of the circuits obtained. We can conclude that our method obtains competitive results in the seven circuits analyzed.Publicación Introducing modularity and homology in grammatical evolution to address the analog electronic circuit design problem(IEEE, 2020-08-24) Castejón, Federico; Carmona, Enrique J.We present a new approach based on grammatical evolution (GE) aimed at addressing the analog electronic circuit design problem. In the new approach, called multi-grammatical evolution (MGE), a chromosome is a variable-length codon string that is divided into as many partitions as subproblems result from breaking down the original optimization problem: circuit topology and component sizing in our case. This leads to a modular approach where the solution of each subproblem is encoded and evolved in a partition of the chromosome. Additionally, each partition is decoded according to a specific grammar and the final solution to the original problem emerges as an aggregation result associated with the decoding process of the different partitions. Modularity facilitates the encoding and evolution of the solution in each subproblem. On the other way, homology helps to reduce the potentially destructive effect associated with standard crossover operators normally used in GE-based approaches. Seven analog circuit designs are addressed by an MGE-based method and the obtained results are compared to those obtained by different methods based on GE and other evolutionary paradigms. A simple parsimony mechanism was also introduced to ensure compliance with design specifications and reduce the number of components of the circuits obtained. We can conclude that our method obtains competitive results in the seven circuits analyzed.Publicación Modeling, localization, and segmentation of the foveal avascular zone on retinal OCT-angiography images(IEEE, 2020-08-17) Carmona, Enrique J.; Díaz González, Macarena; Novo, Jorge; Ortega, MarcosThe Foveal Avascular Zone (FAZ) is a capillary-free area that is placed inside the macula and its morphology and size represent important biomarkers to detect different ocular pathologies such as diabetic retinopathy, impaired vision or retinal vein occlusion. Therefore, an adequate and precise segmentation of the FAZ presents a high clinical interest. About to this, Angiography by Optical Coherence Tomography (OCT-A) is a non-invasive imaging technique that allows the expert to visualize the vascular and avascular foveal zone. In this work, we present a robust methodology composed of three stages to model, localize, and segment the FAZ in OCT-A images. The first stage is addressed to generate two FAZ normality models: superficial and deep plexus. The second one uses the FAZ model as a template to localize the FAZ center. Finally, in the third stage, an adaptive binarization is proposed to segment the entire FAZ region. A method based on this methodology was implemented and validated in two OCT-A image subsets, presenting the second subset more challenging pathological conditions than the first. We obtained localization success rates of 100% and 96% in the first and second subsets, respectively, considering a success if the obtained FAZ center is inside the FAZ area segmented by an expert clinician. Complementary, the Dice score and other indexes (Jaccard index and Hausdorff distance) are used to measure the segmentation quality, obtaining competitive average values in the first subset: 0.84 ± 0.01 (expert 1) and 0.85 ± 0.01 (expert 2). The average Dice score obtained in the second subset was also acceptable (0.70 ± 0.17), even though the segmentation process is more complex in this case.Publicación Modeling, localization, and segmentation of the foveal avascular zone on retinal OCT-angiography images(IEEE, 2020-08-17) Carmona, Enrique J.; Díaz, Macarena; Novo, Jorge; Ortega, MarcosThe Foveal Avascular Zone (FAZ) is a capillary-free area that is placed inside the macula and its morphology and size represent important biomarkers to detect different ocular pathologies such as diabetic retinopathy, impaired vision or retinal vein occlusion. Therefore, an adequate and precise segmentation of the FAZ presents a high clinical interest. About to this, Angiography by Optical Coherence Tomography (OCT-A) is a non-invasive imaging technique that allows the expert to visualize the vascular and avascular foveal zone. In this work, we present a robust methodology composed of three stages to model, localize, and segment the FAZ in OCT-A images. The first stage is addressed to generate two FAZ normality models: superficial and deep plexus. The second one uses the FAZ model as a template to localize the FAZ center. Finally, in the third stage, an adaptive binarization is proposed to segment the entire FAZ region. A method based on this methodology was implemented and validated in two OCT-A image subsets, presenting the second subset more challenging pathological conditions than the first. We obtained localization success rates of 100% and 96% in the first and second subsets, respectively, considering a success if the obtained FAZ center is inside the FAZ area segmented by an expert clinician. Complementary, the Dice score and other indexes (Jaccard index and Hausdorff distance) are used to measure the segmentation quality, obtaining competitive average values in the first subset: 0.84 ± 0.01 (expert 1) and 0.85 ± 0.01 (expert 2). The average Dice score obtained in the second subset was also acceptable (0.70 ± 0.17), even though the segmentation process is more complex in this case.