dc.contributor |
Barcelona Supercomputing Center |
dc.contributor.author |
Fischer e Silva, Renan |
dc.contributor.author |
Carpenter, Paul M. |
dc.date |
2017-09-26 |
dc.identifier.citation |
Fischer e Silva, R.; Carpenter, P. M. High Throughput and Low Latency on Hadoop Clusters Using Explicit Congestion Notification: The Untold Truth. A: "2017 IEEE International Conference on Cluster Computing (CLUSTER)". IEEE, 2017, p. 349-353. |
dc.identifier.citation |
978-1-5386-2326-8 |
dc.identifier.citation |
10.1109/CLUSTER.2017.19 |
dc.identifier.uri |
http://hdl.handle.net/2117/108637 |
dc.language.iso |
eng |
dc.publisher |
IEEE |
dc.relation |
http://ieeexplore.ieee.org/abstract/document/8048946/ |
dc.relation |
info:eu-repo/grantAgreement/ES/1PE/TIN2015-65316-P |
dc.rights |
Attribution-NonCommercial-NoDerivs 3.0 Spain |
dc.rights |
info:eu-repo/semantics/openAccess |
dc.rights |
http://creativecommons.org/licenses/by-nc-nd/3.0/es/ |
dc.subject |
Àrees temàtiques de la UPC::Enginyeria electrònica |
dc.subject |
Cluster analysis--Data processing |
dc.subject |
Throughput |
dc.subject |
Proposals |
dc.subject |
Ports (Computers) |
dc.subject |
Runtime |
dc.subject |
Robustness |
dc.subject |
Receivers |
dc.subject |
Hadoop |
dc.subject |
ECN |
dc.subject |
DCTCP |
dc.subject |
Throughput |
dc.subject |
Latency |
dc.subject |
Computació distribuïda |
dc.subject |
Supercomputadors |
dc.title |
High Throughput and Low Latency on Hadoop Clusters Using Explicit Congestion Notification: The Untold Truth |
dc.type |
info:eu-repo/semantics/submittedVersion |
dc.type |
info:eu-repo/semantics/conferenceObject |
dc.description.abstract |
Various extensions of TCP/IP have been proposed to reduce network latency; examples include Explicit Congestion Notification (ECN), Data Center TCP (DCTCP) and several proposals for Active Queue Management (AQM). Combining these techniques requires adjusting various parameters, and recent studies have found that it is difficult to do so while obtaining both high performance and low latency. This is especially true for mixed use data centres that host both latency-sensitive applications and high-throughput workloads such as Hadoop.This paper studies the difficulty in configuration, and characterises the problem as related to ACK packets. Such packets cannot be set as ECN Capable Transport (ECT), with the consequence that a disproportionate number of them are dropped. We explain how this behavior decreases throughput, and propose a small change to the way that non-ECT-capable packets are handled in the network switches. We demonstrate robust performance for modified AQMs on a Hadoop cluster, maintaining full throughput while reducing latency by 85%. We also demonstrate that commodity switches with shallow buffers are able to reach the same throughput as deeper buffer switches. Finally, we explain how both TCP-ECN and DCTCP can achieve the best performance using a simple marking scheme, in constrast to the current preference for relying on AQMs to mark packets. |
dc.description.abstract |
The research leading to these results has received funding from the European Unions Seventh Framework Programme (FP7/2007–2013) under grant agreement number 610456 (Euroserver).
The research was also supported by the Ministry of Economy and Competitiveness of Spain under the contracts TIN2012-34557 and TIN2015-65316-P, Generalitat de Catalunya (contracts 2014-SGR-1051 and 2014-SGR-1272), HiPEAC-3 Network of Excellence (ICT- 287759), and the Severo Ochoa Program (SEV-2011-00067) of the Spanish
Government. |
dc.description.abstract |
Peer Reviewed |