• optimization of sub-query processing in distributed data integration systems

    جزئیات بیشتر مقاله
    • تاریخ ارائه: 1392/07/24
    • تاریخ انتشار در تی پی بین: 1392/07/24
    • تعداد بازدید: 1101
    • تعداد پرسش و پاسخ ها: 0
    • شماره تماس دبیرخانه رویداد: -
     data integration system (dis) is becoming paramount when cloud/grid applications need to integrate and analyze data from geographically distributed data sources. dis gathers data from multiple remote sources, integrates and analyzes the data to obtain a query result. as clouds/grids are distributed over wide-area networks, communication cost usually dominates overall query response time. therefore we can expect that query performance can be improved by minimizing communication cost.in our method, dis uses a data flow style query execution model. each query plan is mapped to a group of μengines, each of which is a program corresponding to a particular operator. thus, multiple sub-queries from concurrent queries are able to share μengines. we reconstruct these sub-queries to exploit overlapping data among them. as a result, all the sub-queries can obtain their results, and overall communication overhead can be reduced. experimental results show that, when dis runs a group of parameterized queries, our reconstructing algorithm can reduce the average query completion time by 32–48%; when dis runs a group of non-parameterized queries, the average query completion time of queries can be reduced by 25–35%.

سوال خود را در مورد این مقاله مطرح نمایید :

با انتخاب دکمه ثبت پرسش، موافقت خود را با قوانین انتشار محتوا در وبسایت تی پی بین اعلام می کنم
مقالات جدیدترین ژورنال ها