Towards energy-performance trade-off analysis of parallel applications