【计算机专业文献翻译】网络性能的测量

合集下载

相关主题

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

英文翻译：本文出自《Computer Network》第四版Andrew S.Tanenbaum著
Network Performance Measurement
When a network performs poorly, its users often complain to the folks running it, demanding improvements. To improve the performance, the operators must first determine exactly what is going on. To find out what is really happening, the operators must make measurements. In this section we will look at network performance measurements. The discussion below is based on the work of Mogul (1993).
The basic loop used to improve network performance contains the following steps:
1.Measure the relevant network parameters and performance.
2.Try to understand what is going on.
3.Change one parameter.
These steps are repeated until the performance is good enough or it is clear that the last drop of improvement has been squeezed out.
Measurements can be made in many ways and at many locations (both physically and in the protocol stack). The most basic kind of measurement is to start a timer when beginning some activity and see how long that activity takes. For example, knowing how long it takes for a TPDU to be acknowledged is a key measurement. Other measurements are made with counters that record how often some event has happened (e.g., number of lost TPDUS). Finally, one is often interested in knowing the amount of something, such as the number of bytes processed in a certain time interval.
Measuring network performance and parameters has many potential pitfalls. Below we list a few of them. Any systematic attempt to measure network performance should be careful to avoid these.
Make Sure That the Sample Size Is Large Enough
Do not measure the time to send one TPDU, but repeat the measurement, say, one million times and take the average. Having a large sample will reduce the uncertainty in the measured mean and standard deviation. This uncertainty can be computed using standard statistical formulas.
Make Sure That the Samples Are Representative
Ideally, the whole sequence of one million measurements should be repeated at different times of the day and the week to see the effect of different system loads on the measured quantity. Measurements of congestion, for example, are of little use if they are made at a moment when there is no congestion. Sometimes the results may be counterintuitive at first, such as heavy congestion at 10, 11, 1, and 2 o'clock, but no congestion at noon (when all the users are away at lunch).
Be Careful When Using a Coarse-Grained Clock
Computer clocks work by incrementing some counter at regular intervals. For example, a millisecond timer adds 1 to a counter every 1 msec. Using such a timer to measure an event that takes less than 1 msec is possible, but requires some care. (Some computers have more accurate clocks, of course.)
To measure the time to send a TPDU, for example, the system clock (say, in milliseconds) should be read out when the transport layer code is entered and again when it is exited. If the true TPDU send time is 300 µsec, the difference between the two readings will be either 0 or 1, both wrong. However, if the measurement is repeated one million times and the total of all measurements added up and divided by one million, the mean time will be accurate to better than 1 µsec.
Be Sure That Nothing Unexpected Is Going On during Your Tests
Making measurements on a university system the day some major lab project has to be turned in may give different results than if made the next day. Likewise, if some researcher has decided to run a video conference over your network during your tests, you may get a biased result. It is best to run tests on an idle system and create the entire workload yourself. Even this approach has pitfalls though. While you might think nobody will be using the network at 3 A.M., that might be precisely when the automatic backup program begins copying all the disks to tape. Furthermore, there might be heavy traffic for your wonderful World Wide Web pages from distant time zones.
Caching Can Wreak Havoc with Measurements
The obvious way to measure file transfer times is to open a large file, read the whole thing, close it, and see how long it takes. Then repeat the measurement many more times to get a good average. The trouble is, the system may cache the file, so only the first measurement actually involves network traffic. The rest are just reads from the
local cache. The results from such a measurement are essentially worthless (unless you want to measure cache performance).
Often you can get around caching by simply overflowing the cache. For example, if the cache is 10 MB, the test loop could open, read, and close two 10-MB files on each pass, in an attempt to force the cache hit rate to 0. Still, caution is advised unless you are absolutely sure you understand the caching algorithm.
Buffering can have a similar effect. One popular TCP/IP performance utility program has been known to report that UDP can achieve a performance substantially higher than the physical line allows. How does this occur? A call to UDP normally returns control as soon as the message has been accepted by the kernel and added to the transmission queue. If there is sufficient buffer space, timing 1000 UDP calls does not mean that all the data have been sent. Most of them may still be in the kernel, but the performance utility thinks they have all been transmitted.
Understand What You Are Measuring
When you measure the time to read a remote file, your measurements depend on the network, the operating systems on both the client and server, the particular hardware interface boards used, their drivers, and other factors. If the measurements are done carefully, you will ultimately discover the file transfer time for the configuration you are using. If your goal is to tune this particular configuration, these measurements are fine.
However, if you are making similar measurements on three different systems in order to choose which network interface board to buy, your results could be thrown off completely by the fact that one of the network drivers is truly awful and is only getting 10 percent of the performance of the board.
网络性能的测量
当一个网络的运行效果很差的时候，它的用户通常会向网络运行商抱怨并要求提高网络的质量。

为了改善网络的性能，网络操作人员首先必须确定发生了什么问题。

为了找出真正的问题所在，操作人员必须进行测量工作。

在这一小节中，我们来看一看网络性能的测量问题。

下面的讨论以Mogul(1993)的工作为基础。

用来改善网络性能的基本循环过程包括以下步骤：
（1）测量有关的网络参数和性能。

（2）试图理解当前的网络状况。

（3）改变一个参数。

这些步骤不断重复，直到网络的性能已经足够好，或者改善性能的全部空间都已经被发掘出来了。

测量工作可以有许多做法，也可以在许多地点或场所进行（既指物理位置，也指协议栈中的位置）。

最基本的一种测量手段是：在开始某一个动作的时候启动一个定时器，然后确定该需要多长时间。

例如，知道一个TPDU需要多长时间才被确认是一个很关键的测量指标。

其他有一些测量指标可以通过计数器来完成，即记录某种事件发生的次数，比如丢失的TPDU的数量。

最后，人们通常对于某些事物的数量比较感兴趣，比如在特定的时间间隔内所处理的字节数。

测量网络的性能和参数有许多潜在的陷阱。

以下我们列出其中一部分。

任何一种系统化的网络性能测量手段都应该小心地避免这些陷阱。

确保样本空间足够大
不要测量发送一个TPDU的时间，而是重复也测量。

比如说测量1百万次，然后再取平均。

采用大量的样本将可以减小所测量的均值和标准方差中的不确定性。

这种不确定性可以利用标准的统计公式来计算。

确保样本具有代表性
理想情况下，这1百万次测量的完整序列应该在一天或者一周的不同时刻进行重复，从而可以看到不同的系统负载对于所测量指标的影响。

例如，对于拥塞的测量，如果仅仅在没有拥塞的那一时刻来测量拥塞，则这样的测量和结果并没有用。

有时候测量结果初看起来可能不符合直觉，比如在10，11，1和2点钟网络严重拥塞，但是中午时候没有拥塞（所用的用户都去吃午饭了）。

当使用粗粒度时钟的时候一定要谨慎
计算机时钟的工作原理是，每隔固定的时间间隔就递增某一个计数器，例如，一个毫秒定时器每隔1ms就让一个计数器加1。

使用这样的定时器来测量一个持续时间小于1ms的事件是有可能的，但要非常小心。

（当然，有些计算机还有更加精确的时钟。

）
例如，为了测量出发送一个TPDU所需要的时间，当进入传输层代码时以及离开传输层代码时，应该将系统时钟（比如说以毫秒为单位）读出来。

如果TPDU 真正的发送时间是300µs,则两次读取的时间之差要么是０，要么是１，这两个结果都是错误的。

然而，如果重复测量１百万次，则所有测量的总和累加起来，再除以１百万，则平均时间比１µs还要精确得多。

确保在测试过程中不会发生不可预知的事情
在一个大学的网络系统进行测量有可能发生这样的情况：有一天，当一个大型的实验项目在运行的时候你测量的结果跟第二天测量出来的结果可能会有所不同。

同样地，如果有的研究人员决定在你们的网络上运行一个视频会议，而在这个时候你正好在测量，那么你得到的结果可能会偏差。

你最好在一个空闲的系统上运行测试过程，并且根据需要自己来创建所有的工作负载。

不过这种做法也有缺陷。

你可能认为在凌晨3点钟的时候不会有人使用网络，但是，当自动备份程序这时候开始将所有的磁盘数据复制到磁带上的时候，你的想法就不再正确了。

而且，此时其他时区的用户可能会访问你精美的WWW,从而也导致繁重的流量。

缓存机制可能会破坏测量的正确性
为了测量文件传输时间，最显然的方法是打开一个大的文件并读取文件中所有的数据，再关闭文件，然后看这个过程花了多长时间。

然后，多次重复这样的测量过程以便得到一个好的平均值。

然而，麻烦在于，系统可能会将文件缓存起来，所以，仅仅第一次测量才真正涉及到网络传输，其他的测量只不过从本地的缓存中读取数据而已。

因此，这样的测量结果本质上是毫无价值的（除非你的目标是为了测量缓存机制的性能）。

通常你只要简单地溢出缓存的方法就可以避免缓存带来的问题。

例如，如果缓存空间的大小为10MB，那么，测试循环可以轮流地打开、读取和关闭两个10MB 文件，这样做的目的是强迫缓存的命中率为0。

不过。

除非你绝对确定自己理解了缓存算法。

否则仍然要非常小心。

缓冲机制也有类似的影响，一个流行的TCP/IP性能测试工具曾经报告UDP 可以获得比物理线路允许的能力还要好得多的性能。

这是怎么发生的呢？在调用UDP的时候，通常当内核接受了消息之后，控制权马上就返回给应用程序了，而消息则被加入到传输队列中。

如果主机上有足够的缓冲区空间的话，则执行1000UDP调用并不意味着所有的数据都已经被发送出去了，大多数数据仍然在内核中，但是性能测试工具认为是它们已经被传送出去了。

理解你的测试的指标
当你要测试读取一个远程文件所需要的时间时，你的测量结果取决于以下诸多因素；网络、客户和服务器的操作系统、所使用的硬件接口卡、接口卡的驱动程序，等等。

如果你谨慎地执行了测量过程的话，那么，你最终得到的结果是在你所使用的配置环境中的文件传输时间。

如果你的目标是要调整这一特殊的配置环境，那么这些测量结果将是非常有用的。

然而，如果你在三个不同的网络系统上进行类似的测量以便决定应该选购哪一块接口卡，那么你的结果可能完全不具备参考价值，因为其中一个网络驱动非常糟糕，它只能发挥接口卡的１０％性能。