Ignoring the issue of energy consumption in big data stream computing, the problem of high energy consumption generated from data processing needs to be solved. On the basis of performance constraint of big data stream computing platform and our previous work, the research contents of this project focuses on the following three aspects: energy consumption mechanisms, energy models and energy optimization methods for big data stream computing. Firstly, based on the research of energy consumption model of memory, CPU, network bandwidth, disk, and other components, by monitoring resource state while task running we study the energy consumption mechanism of different components and their interactions in big data stream computing environment. Secondly, based on the research of energy consumption mechanism, the model of energy forecast, energy monitoring, and energy settlement can be established based on sampling for big data stream computing before performing a topology. As a result, we can forecast the energy consumption of big data stream computing topology before execution, monitor the dynamic energy consumption while topology is running, and estimate the energy consumption value after the topology is completed. Finally, the optimization of energy consumption aimed at the topology execution of the whole big data stream computing, that is from optimizing the energy consumption of different components for big data stream processing as well as the allocation of resources, to the overall energy efficiency of big data stream processing. This research is expected to improve the overall energy efficiency of big data stream computing topology and cluster, supporting key technologies of power and energy management for big data stream computing.
由于大数据流式计算处理数据时缺乏对能耗问题的考虑,导致其数据处理过程产生的高能耗问题亟需解决。结合大数据流式计算平台的性能约束与前期工作,课题对流式处理的能耗机理、能耗模型及能耗优化三个方面进行研究。首先,在建立内存、CPU、网络带宽与磁盘等元件能耗模型的基础上,结合任务资源的分配与监控,研究大数据流式计算环境下不同元件自身及彼此之间的能耗机理;其次,在能耗机理研究的基础上建立能耗预测、能耗监控及能耗结算三种模型,实现对大数据流式计算拓扑任务执行开始前能耗的采样预测、执行过程中能耗的监控以及执行后能耗的结算功能;最后,能耗优化旨在贯穿整个大数据流式计算的执行,即从优化流式处理不同元件产生的能耗以及资源的分配,到优化流式处理的整体能效。研究结果有望整体上提高大数据流式处理拓扑及集群的能耗效率,形成对大数据流式处理能耗管理关键技术的支撑。
由于大数据流式计算处理数据时缺乏对能耗问题的考虑,导致其数据处理过程产生的高能耗问题亟需解决。结合大数据流式计算平台的性能约束与前期工作,课题对流式处理的能耗机理、能耗模型及能耗优化三个方面进行研究。首先,在建立内存、CPU、网络带宽与磁盘等元件能耗模型的基础上,结合任务资源的分配与监控,研究大数据流式计算环境下不同元件自身及彼此之间的能耗机理;其次,在能耗机理研究的基础上建立能耗预测、耗监控及能耗结算三种模型,实现对大数据流式计算拓扑任务执行开始前能耗的采样预测、执行过程中能耗的监控以及执行后能耗的结算功能;最后,能耗优化旨在贯穿整个大数据流式计算的执行,即从优化流式处理不同元件产生的能耗以及资源的分配,到优化流式处理的整体能效。研究结果整体上提高大数据流式处理拓扑及集群的能耗效率,形成对大数据流式处理能耗管理关键技术的支撑。
{{i.achievement_title}}
数据更新时间:2023-05-31
粗颗粒土的静止土压力系数非线性分析与计算方法
基于LASSO-SVMR模型城市生活需水量的预测
中国参与全球价值链的环境效应分析
基于多模态信息特征融合的犯罪预测算法研究
基于公众情感倾向的主题公园评价研究——以哈尔滨市伏尔加庄园为例
大数据流式计算的网络调度优化理论与算法研究
流式大数据处理的网络性能优化研究
基于异构大数据的流式人工智能计算模式研究
动态负载下大数据流式计算的可用性问题及保障方法研究