Page 118 - Computational Retinal Image Analysis
P. 118

5  Summary and outlook    111




                     Moreover, Guimaraes et al. [147] report segmentation and reconstruction of 3D
                     retinal vasculature. With the further advancement of 3D imaging techniques,
                     this exciting direction of 3D vessel analysis possesses excellent research
                     potential. It is also well connected to existing 3D blood vessel analysis efforts of
                     other organs and via different instruments such as magnetic resonance imaging.
                     Multimodal vessel analysis. One prominent example of multimodal retinal
                     image analysis could be the combined usage of 2D fundus and 3D OCT
                     images in vessel segmentation and tracing. Here, Hu et al. [148] investigate
                     the integration of 2D fundus image and corresponding 3D OCT images in
                     retinal vessel segmentation. SLO fundus image and corresponding macular
                     OCT slices are jointly considered in Ref. [149] in two steps. The first step
                     involves 2D vessel segmentation of fundus images in curvelet domain, with
                     the side information from multiple OCT slices. The second step focuses on 3D
                     reconstruction of the blood vessels from the OCT data. In addition to images,
                     patient-related information such as electronic medical records and genetic
                     data could also be taken into account for better disease diagnosis, and patient
                     prognosis and treatment.
                     Analyzing mobile retinal images. As stated in Ref. [150], the recent development
                     of portable fundus cameras [151] and smartphone-based fundus imaging
                     systems have led to considerable opportunities as well as new challenges to
                     retinal image analysis, such as the demand for more affordable imaging devices
                     with perhaps lower computation cost [152]. To address the need of the emerging
                     mobile retinal diagnostic devices that calls for segmentation techniques with
                     low memory and computation cost, Hajabdollahi et al. [152] propose to train
                     a simple CNN model-based pruning and quantization of the original full-
                     sized model, that is capable of retaining the performance with much reduced
                     computation and size. Looking forward, we expect further and considerable
                     progresses along this direction.



                  5.2  Benchmarks and metrics
                  Benchmark datasets, such as ImageNet [39] and COCO [40], have played a vital role
                  in fueling the recent computer vision breakthroughs. It has been observed that these
                  successful datasets are both of large scale, and richly annotated. For example, over
                  the years, COCO has introduced and aggregated various annotations on categories
                  and shapes of individual objects and stuffs (backgrounds). Currently, it is capable
                  of hosting a broad variety of closely related tasks including instance segmentation,
                  stuff segmentation, object detection, person keypoint localization, as well as image
                  captioning, visual dialog, image attributes, text detection and recognition, among
                  others. By contrast, existing retinal vasculature image datasets, as shown in Table 1,
                  are of small size, and often not richly annotated. While most state-of-the-art results
                  are still reported based on the well-known benchmarks of DRIVE and STARE, the
                  low-resolution images considered in these datasets are considerably lagging behind
   113   114   115   116   117   118   119   120   121   122   123