TY - GEN
T1 - Parallelizing ultrasound image processing using OpenMP on multicore embedded systems
AU - Huang, Lei
AU - Stotzer, Eric
AU - Yi, Hangjun
AU - Chapman, Barbara
AU - Chandrasekaran, Sunita
PY - 2012
Y1 - 2012
N2 - The shift towards multicore architectures poses significant challenges to the programmers. Unlike programming on single core architectures, multicore architectures require the programmer to decide on how the work needs to be distributed across multiple processors. In this contribution, we analyze the needs of a high-level programming model to program multicore architectures. We use OpenMP as the high-level programming model to increase programmer productivity, reduce time to market and development/design costs for these systems. In this work, we have explored the medical ultrasound application using OpenMP on a TI-based Tomahawk platform that is a six-core, high performance multicore DSP system. This application is heavily based on image processing and the goal is to achieve desired level of image quality. We have explored the different cache configurations of the system. In this process, we were able to study the performance impacts of data locality when data objects are placed into different components of the Tomahawk memory system.
AB - The shift towards multicore architectures poses significant challenges to the programmers. Unlike programming on single core architectures, multicore architectures require the programmer to decide on how the work needs to be distributed across multiple processors. In this contribution, we analyze the needs of a high-level programming model to program multicore architectures. We use OpenMP as the high-level programming model to increase programmer productivity, reduce time to market and development/design costs for these systems. In this work, we have explored the medical ultrasound application using OpenMP on a TI-based Tomahawk platform that is a six-core, high performance multicore DSP system. This application is heavily based on image processing and the goal is to achieve desired level of image quality. We have explored the different cache configurations of the system. In this process, we were able to study the performance impacts of data locality when data objects are placed into different components of the Tomahawk memory system.
UR - https://www.scopus.com/pages/publications/84876503456
U2 - 10.1109/GHTCE.2012.6490139
DO - 10.1109/GHTCE.2012.6490139
M3 - Conference contribution
AN - SCOPUS:84876503456
SN - 9781467350853
T3 - 2012 IEEE Global High Tech Congress on Electronics, GHTCE 2012
SP - 131
EP - 138
BT - 2012 IEEE Global High Tech Congress on Electronics, GHTCE 2012
T2 - 2012 IEEE Global High Tech Congress on Electronics, GHTCE 2012
Y2 - 18 November 2012 through 20 November 2012
ER -