SPEC OSG SPECmail2009 Benchmark
Workload Characterization for SPECmail_Ent2009 Metric
Mike Abbott, Yun-seng Chao
December 2008
This document summarizes the studies on mail server workload collected from multiple university and corporate sources, using a variety of IMAP4 clients. The analyzed workloads consist of both SMTP and IMAP4 requests. Each request is described by parameters which fully characterize its behavior. The proposed models, which are obtained by analyzing these parameters, are able to reproduce the behavior of the mail server workloads.
The report is organized as follows. We start with a description of the measurements and of the parameters considered in our studies. We then present the models characterizing the mail server workloads and we briefly describe how to use these models.
Much of the document discusses the workload changes between new SPECmail2009 and the original SPECmail2008 benchmark workload. Many of the internal distributions were updated with complete message and folder profiles provided by Apple, Inc in 2008. Most of this data replaces the original message and mailbox composition distributions. The SMTP traffic levels have been incorporated into the recipient and message size distributions.
One workload addition not discussed in this document is the ability to test using encrypted TCP connections. The reason lies in where this encryption incurs its cost. The e-mail clients issue commands according to user or programatic directives, regardless of the network connection's encryption mode. Empirical data shows both SUT and e-mail clients require extra computing and/or memory resources if encryption exists. Therefore, the benchmark's Secure metric influences the number of concurrent network sessions and interarrival times but not the actual command sequences. The two SPECmail2009 metrics show the effects of encrypted network connections on the SUT.
The measurements analyzed in our studies come from different sources. The measurements related to SMTP and IMAP4 have been provided by four companies and by two universities.? The collected sessions were divided into five IMAP4 and two SMTP groups.? The sessions within each group form the basis for all of the parameters that define the Enterprise User Profile, emulated by the SPECmail2009 benchmark.
IMAP Information Sources – |
||||
Data Source |
Total Number of Users |
Number of IMAP Users |
Data Source Type |
Network Type |
Mirapoint |
223 |
223 |
Small company |
LAN |
Openwave |
2500 |
500 |
Medium company |
WAN |
Sun |
147 |
147 |
Medium workgroup |
LAN |
Apple |
39,970 |
~30,000 |
Large corporation |
LAN/WAN |
|
Unknown |
|
|
LAN |
|
Unknown |
|
|
LAN |
SPECmail2009 (Enterprise Model) |
42,000+ (250 Minimum) |
32,000+ (250 Minimum) |
|
LAN/MAN |
SPECmail2008 (Enterprise Model) |
250 (Minimum) |
250 (Minimum) |
|
LAN/MAN (1% dialup) |
SPECmail2001 (Dialup ISP Model) |
10,000 |
10,000 |
Consumer |
Dialup |
The IMAP4 protocol allows email clients to create and maintain any number of folders and subfolders, in addition to the standard Inbox folder used in the SPECmail2001 POP3 user profile.? The IMAP4 command set also allows email clients to ask the server to describe these structures.? This information is independent of the delivery or retrieval protocols and so is treated outside of specific protocol and/or server context.
MIME is an internet attachment scheme, defined as a formal standard by RFCs 1521, 1522, and 1523.? The Sun and Apple data sets provided detailed information about mailbox and message structure.? Thus they form the basis for the following probability distribution tables used in the benchmark.?
The initial processing of all message sizes distinguished between single part sizes and multipart sizes.? The IMAP4 benchmark prioritizes individual MIME part size over the global message size distribution.
Single Part messages (Sun: 76% of total, Apple: 47% of total)
Multipart Message (Sun: 24% of total, Apple: 53% of total)
Below are the distributions used in constructing messages in compliant with the MIME standard.
MIME Part size (bytes) vs. Probabilities Distribution |
||||||||
Part Size |
Probability (Sun) |
Probability (Apple) |
Part Size |
Probability (Sun) |
Probability (Apple) |
Part Size |
Probability (Sun) |
Probability (Apple) |
0 |
N/A |
0.04% |
256 |
10.5% |
2.28% |
128 KB |
0.7% |
1.88% |
1 |
N/A |
< 0.001% |
512 |
15.6% |
6.37% |
256 KB |
0.4% |
1.21% |
2 |
0.6% |
< 0.01% |
1 KB |
13.6% |
9.22% |
512 KB |
0.3% |
0.68% |
4 |
0.1% |
< 0.01% |
2 KB |
13.9% |
18.00% |
1 MB |
0.2% |
0.45% |
8 |
0.4% |
< 0.01% |
4 KB |
13.4% |
28.97% |
2 MB |
0.1% |
0.27% |
16 |
0.8% |
< 0.01% |
8 KB |
8.5% |
11.37% |
4 MB |
N/A |
0.19% |
32 |
1.8% |
0.05% |
16 KB |
4.3% |
6.46% |
8 MB |
N/A |
0.10% |
64 |
4.1% |
0.31% |
32 KB |
2.3% |
3.91% |
16 MB |
N/A |
0.03% |
128 |
7.2% |
5.18% |
64 KB |
1.2% |
3.02% |
32 MB |
N/A |
0.01% |
|
|
|
|
|
|
64 MB |
N/A |
< 0.01% |
The following tables show the distribution of the number of MIME parts at the top level (without regard to nesting). It reflects the count of multipart/mixed parts immediately “attached” to the main message. It does not reflect any counting of multipart/alternative parts (i.e. text/plain and text/html, alternative formats of the same text). Nor does it reflect the MIME attachment depths (“attachments” to “attachments” or forwarded messages).
MIME Top-Level Part Counts Distribution |
||||||||
Part Count |
Probability (Sun) |
Probability (Apple) |
Part Count |
Probability (Sun) |
Probability (Apple) |
Part Count |
Probability (Sun) |
Probability (Apple) |
0 |
N/A |
46.69% |
3 |
1.99% |
2.51% |
6 |
N/A |
0.06% |
1 |
75.76% |
3.77% |
4 |
0.24% |
0.29% |
7 |
N/A |
0.07% |
2 |
21.91% |
46.20% |
5 |
0.09% |
0.26% |
8+ |
N/A |
0.15% |
The next tables show the distribution of the nested MIME Part Levels that occur within a given message from the sample of MIME parts. It generally reflects messages or attachments which are forwarded multiple times, each time adding another depth level to the resulting message.
Distribution of MIME Part Depths |
||||||||
Part Depth |
Probability (Sun) |
Probability (Apple) |
Part Depth |
Probability (Sun) |
Probability (Apple) |
Part Depth |
Probability (Sun) |
Probability (Apple) |
0 or 1 |
91.24% |
90.18% |
3 |
0.87% |
0.62% |
5 |
0.03% |
0.01% |
2 |
7.73% |
9.14% |
4 |
0.13% |
0.04% |
6+ |
N/A |
< 0.01% |
The following tables show the distribution of primary MIME Content Type (not including subtype) of all the parts in the entire sample.
MIME Content Type Distribution |
|||||
Content type |
Probability (Sun) |
Probability (Apple) |
Content type |
Probability (Sun) |
Probability (Apple) |
TEXT |
92.193% |
86.584% |
IMAGE |
0.888% |
5.943% |
APPLICATION |
4.265% |
6.971% |
AUDIO |
0.016% |
0.018% |
MESSAGE |
2.633% |
0.465% |
VIDEO |
0.004% |
0.019% |
After Sun's values were reviewed, a former employee noted that the Unix company that provided MIME distributions tended to use more text messages. Other companies have more and larger MIME parts that have richer, non-textual, content such as word processor documents, presentations, spreadsheets, web pages, calendar events, images, audio, and both rich and simple alternate MIME structures. The major effect of this shift is a tendency to increase the overall message sizes, and decreasing the Text content type in favor of the other categories.
However, increased Alternate structures does not eliminate the Text portion's counts. It just increases the other content types counters. Also, the IMAP server is not required to interpret the actual MIME parts content. It must extract the MIME part(s) and send the content, as is, to the IMAP4 client, which performs the interpretation. Therefore, the shift in Content Type distribution affects the benchmark's MIME structure of the message delivered to the SUT. The SUT still must deconstruct these MIME structures, but not the actual content.
The following tables show the distribution of messages in folders at the first five levels.
Level by Level Message Probability Distributions - Mirapoint, Openwave, Sun |
|||||||||
Top Level |
Level 1 |
Level 2 |
Level 3 |
Level 4 |
|||||
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
0 |
16.4% |
0 |
8.1% |
0 |
6.1% |
0 |
6.8% |
0 |
1.0% |
1 |
21.5% |
1 |
31.9% |
1 |
48.1% |
1 |
49.5% |
1 |
81.4% |
2 |
3.4% |
2 |
4.6% |
2 |
3.2% |
2 |
3.2% |
2 |
1.0% |
3 |
2.8% |
3 |
2.9% |
3 |
2.1% |
3 |
3.2% |
3 |
1.0% |
4 |
2.1% |
4 |
2.4% |
4 |
2.7% |
4 |
2.2% |
5 |
1.0% |
5 |
2.1% |
5 |
2.0% |
5 |
1.5% |
5 |
2.0% |
6 |
2.9% |
6 |
1.7% |
6 |
1.7% |
6 |
2.3% |
6 |
1.8% |
20 |
4.9% |
7 |
1.2% |
7 |
1.6% |
7 |
1.6% |
7 |
1.8% |
30 |
2.0% |
8 |
1.5% |
8 |
1.1% |
8 |
1.5% |
9 |
2.0% |
40 |
1.0% |
9 |
1.5% |
9 |
1.1% |
9 |
1.2% |
10 |
1.1% |
80 |
2.0% |
20 |
7.3% |
10 |
1.3% |
20 |
7.8% |
20 |
10.3% |
200 |
2.0% |
30 |
5.2% |
20 |
7.8% |
30 |
3.8% |
30 |
4.1% |
|
|
40 |
3.0% |
30 |
5.3% |
40 |
3.1% |
40 |
3.1% |
|
|
50 |
2.0% |
40 |
3.8% |
50 |
2.1% |
50 |
1.3% |
|
|
60 |
1.9% |
50 |
2.6% |
60 |
1.2% |
70 |
1.8% |
|
|
70 |
1.4% |
60 |
1.8% |
80 |
1.6% |
100 |
1.3% |
|
|
80 |
1.3% |
70 |
1.6% |
100 |
1.3% |
200 |
2.2% |
|
|
90 |
1.0% |
80 |
1.3% |
200 |
2.5% |
600 |
1.4% |
|
|
200 |
5.6% |
90 |
1.1% |
300 |
1.2% |
3000 |
1.1% |
|
|
300 |
3.0% |
200 |
5.9% |
500 |
1.2% |
|
|
|
|
400 |
1.3% |
300 |
1.9% |
800 |
1.0% |
|
|
|
|
500 |
1.0% |
400 |
1.5% |
2000 |
3.0% |
|
|
|
|
600 |
1.1% |
500 |
1.2% |
|
|
|
|
|
|
1000 |
2.2% |
700 |
1.6% |
|
|
|
|
|
|
2000 |
3.1% |
1000 |
1.2% |
|
|
|
|
|
|
3000 |
1.5% |
2000 |
1.5% |
|
|
|
|
|
|
4000 |
2.3% |
5000 |
1.2% |
|
|
|
|
|
|
Level by Level Message Probability Distributions - Apple |
|||||||||
Top Level |
Level 1 |
Level 2 |
Level 3 |
Level 4 |
|||||
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
0 |
0.84% |
0 |
32.83% |
0 |
15.35% |
0 |
10.21% |
0 |
9.45% |
1 |
2.10% |
1 |
6.79% |
1 |
8.21% |
1 |
11.06% |
1 |
9.64% |
2 |
0.66% |
2 |
3.96% |
2 |
5.70% |
2 |
6.40% |
2 |
7.93% |
3 |
0.47% |
3 |
2.94% |
3 |
4.31% |
3 |
4.83% |
3 |
6.06% |
4 |
0.80% |
4 |
2.31% |
4 |
3.52% |
4 |
4.05% |
5 |
9.74% |
5 |
0.87% |
5 |
2.03% |
5 |
2.97% |
5 |
3.54% |
6 |
4.02% |
6 |
0.77% |
6 |
1.74% |
6 |
2.56% |
6 |
2.94% |
20 |
25.41% |
7 |
0.95% |
7 |
1.50% |
7 |
2.22% |
7 |
2.76% |
30 |
6.47% |
8 |
0.75% |
8 |
1.35% |
8 |
2.01% |
9 |
4.61% |
40 |
4.52% |
9 |
0.6% |
9 |
1.26% |
9 |
1.85% |
10 |
1.97% |
80 |
6.90% |
20 |
6.07% |
10 |
1.16% |
20 |
12.57% |
20 |
12.82% |
200 |
9.88% |
30 |
4.10% |
20 |
7.82% |
30 |
6.28% |
30 |
6.94% |
|
|
40 |
3.75% |
30 |
4.57% |
40 |
4.07% |
40 |
4.22% |
|
|
50 |
3.01% |
40 |
3.13% |
50 |
2.97% |
50 |
2.96% |
|
|
60 |
2.83% |
50 |
2.40% |
60 |
2.26% |
70 |
3.97% |
|
|
70 |
2.62% |
60 |
1.84% |
80 |
3.44% |
100 |
3.39% |
|
|
80 |
2.08% |
70 |
1.48% |
100 |
2.37% |
200 |
5.25% |
|
|
90 |
2.14% |
80 |
1.29% |
200 |
6.10% |
600 |
7.04% |
|
|
200 |
14.91% |
90 |
1.09% |
300 |
|
3000 |
1.07% |
|
|
300 |
|
200 |
6.54% |
500 |
5.11% |
|
|
|
|
400 |
|
300 |
|
800 |
2.55% |
|
|
|
|
500 |
17.52% |
400 |
|
2000 |
3.58% |
|
|
|
|
600 |
|
500 |
5.18% |
|
|
|
|
|
|
1000 |
11.03% |
700 |
|
|
|
|
|
|
|
2000 |
8.22% |
1000 |
2.77% |
|
|
|
|
|
|
3000 |
|
2000 |
1.92% |
|
|
|
|
|
|
4000 |
12.91% |
5000 |
2.09% |
|
|
|
|
|
|
Here is the same data from Apple bucketed such that each contains roughly five percentage points. These are the actual values used in the benchmark.
Level by Level Message Probability Distributions - Apple |
|||||||||
Top Level |
Level 1 |
Level 2 |
Level 3 |
Level 4 |
|||||
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
0 |
0.84% |
0 |
32.83% |
0 |
15.35% |
0 |
10.21% |
0 |
9.45% |
5 |
4.90% |
1 |
6.79% |
1 |
8.21% |
1 |
11.06% |
1 |
9.64% |
12 |
5.00% |
3 |
6.89% |
2 |
5.70% |
2 |
6.40% |
2 |
7.93% |
22 |
5.07% |
6 |
6.08% |
4 |
7.83% |
4 |
8.88% |
3 |
6.06% |
35 |
5.19% |
10 |
5.27% |
6 |
5.53% |
6 |
6.48% |
4 |
5.14% |
51 |
5.01% |
16 |
5.28% |
9 |
6.08% |
8 |
5.31% |
6 |
8.62% |
70 |
5.15% |
25 |
5.03% |
13 |
5.73% |
11 |
5.71% |
8 |
6.25% |
95 |
5.16% |
40 |
5.21% |
18 |
5.19% |
15 |
5.86% |
11 |
6.61% |
127 |
5.10% |
65 |
5.01% |
25 |
5.09% |
20 |
5.28% |
14 |
5.47% |
165 |
5.09% |
111 |
5.00% |
35 |
5.07% |
27 |
5.15% |
19 |
5.95% |
212 |
5.01% |
212 |
5.01% |
51 |
5.06% |
38 |
5.27% |
26 |
5.51% |
274 |
5.02% |
524 |
5.00% |
77 |
5.06% |
56 |
5.15% |
36 |
5.12% |
356 |
5.05% |
2577 |
5.00% |
126 |
5.05% |
91 |
5.06% |
55 |
5.11% |
466 |
5.03% |
3000+ |
1.60% |
239 |
5.00% |
169 |
5.01% |
104 |
5.05% |
623 |
5.02% |
|
|
654 |
5.00% |
462 |
5.00% |
359 |
5.01% |
855 |
5.01% |
|
|
2000+ |
5.05% |
1000+ |
4.17% |
500+ |
3.08% |
1232 |
5.01% |
|
|
|
|
|
|
|
|
1922 |
5.00% |
|
|
|
|
|
|
|
|
3275 |
5.01% |
|
|
|
|
|
|
|
|
4000+ |
8.33% |
|
|
|
|
|
|
|
|
A mail server that supports IMAP is likely to support a hierarchy of several mailboxes (also known folders) in addition to the default INBOX mailbox for each user.? Below are several distributions to construct the structure of mailboxes contained within a mailstore supported by IMAP.? The data used is extracted from the four enterprise data samples (Mirapoint, Openwave, Sun, Apple).
The following tables show the probably of an individual user having a certain number of mailboxes (aka folders) at each level (depth).? The data reflects the probability distributions for the first five (5) levels, even though the actual samples went many levels deeper than that.?
Level by Level Subfolder Probability Distributions - Mirapoint, Openwave, Sun |
|||||||||
Top to Level 1 |
Level 1 to 2 |
Level 2 to 3 |
Level 3 to 4 |
Level 4 to 5 |
|||||
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
1 |
34.9% |
1 |
31.4% |
1 |
43.0% |
1 |
39.6% |
1 |
36.8% |
2 |
21.7% |
2 |
12.4% |
2 |
14.9% |
2 |
12.6% |
2 |
7.9% |
3 |
11.6% |
3 |
7.4% |
3 |
9.1% |
3 |
8.1% |
3 |
39.5% |
4 |
7.0% |
4 |
5.6% |
4 |
6.8% |
4 |
10.8% |
4 |
5.3% |
5 |
2.0% |
5 |
4.0% |
5 |
3.5% |
5 |
2.7% |
6 |
2.6% |
6 |
2.4% |
6 |
2.4% |
6 |
4.1% |
6 |
7.2% |
7 |
2.6% |
7 |
1.5% |
7 |
5.0% |
7 |
2.0% |
7 |
2.7% |
8 |
5.3% |
8 |
0.7% |
9 |
5.8% |
8 |
2.0% |
8 |
0.9% |
|
|
9 |
0.7% |
10 |
2.6% |
9 |
3.3% |
9 |
1.8% |
|
|
10 |
0.7% |
15 |
7.4% |
10 |
1.0% |
14 |
3.6% |
|
|
20 |
8.1% |
20 |
3.2% |
20 |
5.8% |
15 |
0.9% |
|
|
30 |
3.7% |
30 |
7.2% |
30 |
3.0% |
20 |
3.6% |
|
|
40 |
1.8% |
70 |
3.4% |
40 |
0.5% |
25 |
2.7% |
|
|
50 |
2.0% |
200 |
1.8% |
50 |
0.5% |
30 |
1.8% |
|
|
103 |
1.3% |
246 |
0.4% |
61 |
0.3% |
42 |
0.9% |
|
|
Level by Level Subfolder Probability Distributions - Apple |
|||||||||
Top to Level 1 |
Level 1 to 2 |
Level 2 to 3 |
Level 3 to 4 |
Level 4 to 5 |
|||||
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
1 |
0.38% |
1 |
37.28% |
1 |
38.86% |
1 |
41.69% |
1 |
37.52% |
2 |
0.71% |
2 |
14.13% |
2 |
17.28% |
2 |
17.26% |
2 |
23.47% |
3 |
41.11% |
3 |
12.26% |
3 |
10.23% |
3 |
10.82% |
3 |
13.88% |
4 |
17.15% |
4 |
6.60% |
4 |
7.07% |
4 |
6.71% |
4 |
6.94% |
5 |
8.48% |
5 |
5.41% |
5 |
5.13% |
5 |
5.30% |
5 |
3.47% |
6 |
5.59% |
6 |
4.09% |
6 |
3.69% |
6 |
3.56% |
6 |
3.64% |
7 |
4.01% |
7 |
3.14% |
7 |
3.06% |
7 |
2.51% |
7 |
1.98% |
8 |
3.24% |
8 |
2.57% |
8 |
2.18% |
8 |
1.78% |
8 |
1.16% |
9 |
2.66% |
9 |
2.08% |
9 |
1.97% |
9 |
1.74% |
9 |
0.99% |
10 |
2.04% |
10 |
1.66% |
10 |
1.77% |
10 |
1.10% |
10 |
1.16% |
15 |
6.57% |
15 |
4.97% |
15 |
4.07% |
15 |
3.79% |
15 |
2.64% |
20 |
3.28% |
20 |
2.40% |
20 |
1.94% |
20 |
2.15% |
20 |
1.32% |
25 |
1.77% |
25 |
1.17% |
25 |
0.77% |
25 |
0.64% |
25 |
1.32% |
50 |
2.49% |
50 |
1.66% |
50 |
1.66% |
50 |
0.82% |
50 |
0.33% |
100 |
0.40% |
100 |
0.38% |
100 |
0.27% |
100 |
0.05% |
|
|
500 |
0.10% |
500 |
0.17% |
500 |
0.06% |
500 |
0.09% |
|
|
501+ |
0.01% |
501+ |
0.02% |
|
|
|
|
501+ |
0.17% |
The following tables show the percent of folders at each level containing any subfolders.
Level by Level Folders With Any Subfolders - Mirapoint, Openwave, Sun |
|||||||||
Top to Level 1 |
Level 1 to 2 |
Level 2 to 3 |
Level 3 to 4 |
Level 4 to 5 |
|||||
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
0 |
59.0% |
0 |
64.0% |
0 |
80.0% |
0 |
78.4% |
0 |
97.4% |
1 |
21.9% |
1 |
20.6% |
1 |
15.7% |
1 |
14.4% |
1 |
2.6% |
2 |
7.5% |
2 |
9.0% |
2 |
2.3% |
2 |
3.6% |
|
|
3 |
3.3% |
3 |
1.4% |
3 |
0.8% |
3 |
1.8% |
|
|
4 |
2.0% |
4 |
1.2% |
4 |
1.0% |
4 |
1.8% |
|
|
5 |
0.4% |
5 |
0.8% |
6 |
0.2% |
|
|
|
|
6 |
1.3% |
6 |
1.0% |
|
|
|
|
|
|
7 |
2.0% |
7 |
0.6% |
|
|
|
|
|
|
8 |
0.4% |
8 |
0.2% |
|
|
|
|
|
|
9 |
0.7% |
9 |
0.2% |
|
|
|
|
|
|
10 |
0.7% |
10 |
0.4% |
|
|
|
|
|
|
11 |
0.4% |
15 |
0.4% |
|
|
|
|
|
|
21 |
0.25% |
19 |
0.2% |
|
|
|
|
|
|
26 |
0.15% |
|
|
|
|
|
|
|
|
Level by Level Folders With Any Subfolders - Apple |
|||||||||
Top to Level 1 |
Level 1 to 2 |
Level 2 to 3 |
Level 3 to 4 |
Level 4 to 5 |
|||||
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
Width |
Probability |
0 |
94.58% |
0 |
90.62% |
0 |
92.14% |
0 |
92.81% |
0 |
95.25% |
1 |
2.02% |
1 |
3.65% |
1 |
3.28% |
1 |
2.70% |
1 |
2.04% |
2 |
0.77% |
2 |
1.62% |
2 |
1.36% |
2 |
1.69% |
2 |
1.10% |
3 |
0.67% |
3 |
0.96% |
3 |
0.85% |
3 |
1.00% |
5 |
0.98% |
4 |
0.36% |
4 |
0.66% |
4 |
0.53% |
5 |
0.75% |
10 |
0.63% |
5 |
0.29% |
5 |
0.48% |
5 |
0.42% |
10 |
1.05% |
|
|
6 |
0.22% |
6 |
0.35% |
6 |
0.28% |
|
|
|
|
7 |
0.17% |
7 |
0.29% |
7 |
0.20% |
|
|
|
|
8 |
0.14% |
8 |
0.20% |
8 |
0.14% |
|
|
|
|
9 |
0.11% |
9 |
0.18% |
9 |
0.14% |
|
|
|
|
10 |
0.09% |
10 |
0.17% |
10 |
0.66% |
|
|
|
|
20 |
0.58% |
15 |
0.82% |
|
|
|
|
|
|
Below is a walk through of the construction of a folder tree with a diagram to illustrate the use of the above distribution tables in creating a folder tree for user “U1”. The probability values used are only examples, not actual distribution table entries.
Folder Level Construction for User “U1” Example |
|||
Level |
Next Level |
Probability Computation |
Diagram Representation |
0 |
1 |
10.1% probability of 10 sub-folders |
Create folders A1 through
A10. |
|
|
7.2% probability of 2
folders having sub-folders |
Mark folders A5 and A10 red
to indicate presence of Level 2 sub-folders |
1 |
2 |
6.3% probability of 7
sub-folders under A5 |
Create folders B1 through
B7 under A5 |
|
|
23.5% probability of 1
sub-folder under A10 |
Folder B1 under A10 |
2 |
3 |
5.4% probability of 1
folder under A5 having any sub-folders |
Mark folder A5.B5 red to
indicate presence of Level 3 sub-folders |
|
|
32.4% probability of 0
folders under A10 having any sub-folders |
No subfolders under A10.B1 |
|
|
35.8% probability of 1
level-3 sub-folder under A5.B5 |
Create folder C1 under
A5.B5 |
3 |
4 |
56.8% probability of 0 folders
under A5.B5 having any sub-folders |
No subfolders under
A5.B5.C1 |
The diagram below shows the mailbox structure for user U1.
Figure 1: Mailbox Structure Diagram
The overall peak traffic hour must be based on both SMTP and the corresponding IMAP activity over the same period of time.? Therefore, only two data samples were used to determine the relative workloads – Mirapoint and Openwave.? The other data samples did not provide corresponding SMTP logs for this purpose.
The following table shows the overall traffic volumes and users over the course of the peak day (determined by total number of message activity from the data samples).
Peak Mail Server
Traffic – |
||||||||
Data |
Mirapoint Samples |
Openwave Samples |
||||||
Sample Hour |
SMTP |
IMAP |
Combined |
Unique Sender/Rcpt |
SMTP |
IMAP |
Combined |
Unique Sender/Rcpt |
0 |
503 |
57 |
560 |
|
1169 |
5166 |
6335 |
|
1 |
571 |
60 |
631 |
|
1289 |
5435 |
6724 |
|
2 |
519 |
60 |
579 |
|
1033 |
4319 |
5352 |
|
3 |
456 |
60 |
516 |
|
1114 |
4210 |
5324 |
|
4 |
479 |
60 |
539 |
|
1158 |
4054 |
5212 |
|
5 |
503 |
63 |
566 |
|
1076 |
3777 |
4853 |
|
6 |
550 |
60 |
610 |
|
1108 |
3503 |
4611 |
|
7 |
869 |
103 |
972 |
|
1042 |
4566 |
5608 |
|
8 |
942 |
606 |
1548 |
|
1449 |
7383 |
8832 |
|
9 |
1198 |
1075 |
2273 |
|
2174 |
7315 |
9489 |
|
10 |
1029 |
2278 |
3307 |
90/160 |
2082 |
7247 |
9329 |
|
11 |
987 |
23015 |
24002 |
|
2217 |
6331 |
8548 |
|
12 |
874 |
2052 |
2926 |
|
2079 |
6186 |
8265 |
|
13 |
978 |
1507 |
2485 |
|
2120 |
7784 |
9904 |
|
14 |
1560 |
1235 |
2795 |
|
2818 |
8246 |
11064 |
|
15 |
1485 |
1119 |
2604 |
|
3809 |
10196 |
14005 |
|
16 |
841 |
783 |
1624 |
|
4846 |
10620 |
15466 |
1836/1836 |
17 |
803 |
541 |
1344 |
|
5665 |
9306 |
14971 |
|
18 |
502 |
466 |
968 |
|
5513 |
8504 |
14017 |
|
19 |
360 |
412 |
772 |
|
5125 |
6462 |
11587 |
|
20 |
316 |
249 |
565 |
|
4177 |
6260 |
10437 |
|
21 |
476 |
215 |
691 |
|
4440 |
6067 |
10507 |
|
22 |
377 |
218 |
595 |
|
4271 |
6133 |
10404 |
|
23 |
340 |
229 |
569 |
|
4004 |
6178 |
10182 |
|
Daily |
17,518 |
36,523 |
30,039 |
238/168 |
65,778 |
155,248 |
155,248 |
2,254/3,000 |
The Peak Hour Percent of Active Users is computed by first using the larger of the two unique Sending or Recipient users and dividing that value by the total number of provisioned users.? The percentages are then pro-rated based on the relative number of actual users to compute the actual Percent of Active Users used in the benchmark.
Peak Hour Percent Provisioned Users |
|||
Company (Source) |
Data Type |
Number of Users |
Percent PH/Prov |
Mirapoint |
Peak Hour |
160 |
|
|
Provisioned |
269 |
59% |
Openwave |
Peak Hour |
1,836 |
|
|
Provisioned |
2,299 |
80% |
Normalized PH Percent Active Users |
78% |
PEAK_PCT_USERS =
"78"
The Peak Hour Activity Percentage can be derived by using the traffic volume from the peak hour and the daily total for each protocol.? Again the benchmark value is computed by pro-rating each data sample within the overall user counts.
Peak Hour Percent Of Daily Traffic by Protocol |
|||
Company |
SMTP |
IMAP |
Combined |
Mirapoint |
6% |
6% |
11% |
Openwave |
7% |
7% |
10% |
Normalized PH Percent of Daily Traffic |
10% |
The SMTP log files reflect mail transfer agent workloads from the four enterprises, three collected over the course of fourteen (14) to thirty (30) days of operation and one from one day. The workload refers to all the requests processed by the mail server for delivering incoming and outgoing messages. These enterprises ranged from approximately 120 to 40,000 users.? The data logs cover the full 24-hour day, over the course of the data collection period.
The parameters used to describe the requests processed by the mail server are:
The table below shows the statistics for SMTP traffic flows and message sizes for the four enterprises.?? The ISP user profile statistics are included to illustrate the difference with the original user model.
SMTP In/Out-bound Traffic – |
||||
Data Source |
Percent Inbound Traffic |
Percent Outbound Traffic |
Average Message Size (KB) |
Data Source Type |
Mirapoint |
85% |
15% |
24 |
Small company |
Openwave |
92% |
8% |
44 |
Medium company |
Sun |
98% |
2% |
23 |
Medium workgroup |
Apple |
Unknown |
Unknown |
105 |
Large corporation |
SPECmail2009 |
93% |
7% |
101 |
Pro-rated medium/large company |
SPECmail2008 |
93% |
7% |
38 |
Pro-rated small/medium company |
SPECmail2001 (Dialup ISP Model) |
53% |
47% |
25 |
Consumer Dialup |
The following two tables contain the profile of the number of recipients per
message, based on the Mirapoint and Openwave SMTP data from the busiest day of
the week. The Apple SMTP sample did not distinguish between remote and local
recipients, nor the actual recipient count from the RCPT TO step. So this
version reuses the original recipient information. This data was extracted from
both recipients named in the RCPT TO lines, as well as, recipient counts based
on the mailing list expansions. The benchmark uses the probably distributions
in the second table to generate the actual SMTP traffic.
Peak Hour
SMTP Message Rate Comparison |
||||
Company (Source) |
Data Type |
Peak Hour Total Mesg/User |
Daily Total Mesg/User |
Peak Hour Mesg per Unique
User |
Mirapoint |
Sender |
11.4 |
73.6 |
8.7 |
|
Recipient |
6.4 |
65.1 |
6.7 |
Openwave |
Sender |
2.6 |
29.2 |
0.5 |
|
Recipient |
2.6 |
21.9 |
4.9 |
Normalized PH Messages Per User |
5 |
Peak Hour
From/To Analysis |
||||
Company (Source) |
Data Type |
From Local to Remote |
From Local to Local |
From Remote to Local |
Mirapoint |
Count |
84 |
431 |
262 |
|
% of Total |
11% |
55% |
34% |
Openwave |
Count |
195 |
789 |
429 |
|
% of Total |
14% |
56% |
30% |
Normalized PH SMTP Message Flow |
13% |
56% |
31% |
SMTP Recipients per Message – |
|||
Data Source |
Minimum |
Average |
Maximum |
Mirapoint |
1 |
2.0 |
133 |
Openwave |
1 |
3.3 |
74 |
Sun Microsystems |
n/a |
n/a |
n/a |
Apple |
1 |
3.9 |
2061 |
SPECmail2009 Benchmark |
1 |
3.8 |
2061 |
SPECmail2008 Benchmark |
1 |
3.1 |
133 |
SPECmail2001 Benchmark |
1 |
2 |
20 |
SMTP Recipients per Message Distribution |
|||||
Recipients |
Probability |
Probability |