Mailbox Creation Distribution ***************************** By default, when creating a mailbox in Cyrus IMAP: * the backend with the most free disk space is selected on the Murder frontend * the partition with the most free disk space is selected on the backend This may not be the most appropriate backend or partition to create the new mailbox on, and Cyrus IMAP therefor allows for a variety of additional modes of calculating and selecting the most appropriate backend and partition. The exact mode for the selection is controlled with the "imapd.conf" setting "serverlist_select_mode" on the frontend and "partition_select_mode" on the backend. Alternatively, a default backend can be configured with the "defaultserver" setting on a frontend, and a default partition can be configured with the "defaultpartition" on a backend. Prior to Cyrus IMAP version 2.5, when creating a mailbox, should no target partition have been specified, the mailbox is either created on: * the configured "defaultserver", or * the server that is found to have the most free disk space, and * the configured "defaultpartition", or * the most fitting partition if "defaultpartition" is not configured. New configuration options are available since Cyrus IMAP 2.5, that allow more weighted and better balanced backend and partition selection. Selection Mode ============== Among the partitions, how the most fitting one is selected depends on the configured selection mode: "partition_mode" setting in "/etc/imapd.conf" Available Selection Modes ------------------------- * random Choice is (pseudo-)random. Each partition has the same probability of being selected. * freespace-most The partition which has the most free space (counted in KiB units) is selected. Note: Example of "freespace-most" selection on backendSuppose that the configured partitions are: * "part1" which has a 1000GiB capacity, 400GiB being free (that is, 40% free space) * "part2" which has a 1000GiB capacity, 600GiB being free (that is, 60% free space) * "part3" which has a 100GiB capacity, 30GiB being free (that is, 30% free space) * "part4" which has a 100GiB capacity, 70GiB being free (that is, 70% free space) In that case "part2" will be selected as most fitting, since 600GiB of free space is the biggest of all partitions. * freespace-percent-most The partition which has the most percentage of free space is selected. Note: Example of "freespace-percent-most" selection on backendIn the same conditions, "part4" would be selected as most fitting, since 70% of free space is the biggest of all partitions. * freespace-percent-weighted For each partition, the percentage of free space is its weight. Then a weighted choice is performed to select one of those partitions. As such, the more free space the partition has, the more chances it has to be selected. Note: Example of "freespace-percent-weighted" selection on backendIn the same conditions, the weight of each partition would be: * 40 for "part1" * 60 for "part2" * 30 for "part3" * 70 for "part4" The sum of each weight being 200, the probability for each partition to be selected as most fitting would be: * 20% for "part1" * 30% for "part2" * 15% for "part3" * 35% for "part4" Usage convergence ----------------- In "freespace-percent-weighted" mode, partitions percentage usages converge towards 100%. So if they have different usages, those differences will stay and only really diminish upon reaching 100% of usage. You may also observe growing differences between partitions usage when those partitions do not have the same total disk space. * freespace-percent-weighted-delta As for "freespace-percent-weighted", a weight is associated to each partition. It is computed as follows: (*percentage of freespace of partition*) - (*lowest percentage of freespace of all partitions*) + 0.5 Then a weighted choice is performed to select one of those partitions. As such, considering the percentages of usage, the more the partition is lagging behind the most used partition (which is the one with the lowest percentage of free space), the more chances it has to be selected. Computed weight --------------- The added 0.5 in partitions weight is so that selection gets smoother when all partitions get close to each other. Note: **Example of ``freespace-percent-weighted-delta`` Selection on Backend**In the same conditions, the weight of each partition would be: * 40 - 30 + 0.5 = 10.5 for "part1" * 60 - 30 + 0.5 = 30.5 for "part2" * 30 - 30 + 0.5 = 0.5 for "part3" * 70 - 30 + 0.5 = 40.5 for "part4" Then the probability for each partition to be selected as most fitting would be: * 12.8% for "part1" * 37.2% for "part2" * 0.6% for "part3" * 49.4% for "part4" **Usage convergence**In "freespace-percent-weighted-delta" mode, partitions percentage usages converge towards the most used one. And then partitions usages grow equally. Special cases ============= What happens when two partitions are equal as most fitting? ----------------------------------------------------------- Suppose you are using the "freespace-most" selection mode, that two (or more) partitions have the same free size, and that this freespace happens to be the biggest one of all configured partitions. In that case, only one of those partitions will be selected. You may not know in advance which one will be: it depends of the order in which configured partitions are stored in memory ("hashtable"). In particular, it may not be the first one that appears listed in your "/etc/imapd.conf" configuration file. Also note that since the selected partition will now have less free space, it shall not be seen as most fitting next time. What happens when two partitions point to the same device? ---------------------------------------------------------- Suppose you are using the "freespace-most" or "freespace-percent-most" selection mode, and that two (or more) partitions actually point to the same device (that is the device id is the same). In that case, only one of those partitions will be checked, as if the others were not configured. You may not know in advance which one will be: it depends of the order in which configured partitions are stored in memory ("hashtable"). In particular, it may not be the first one that appears listed in your "/etc/imapd.conf" configuration file. Partitions Exclusion ==================== Partitions listed in the "partition_mode_exclude" setting are permanently excluded from being selected. * partition_mode_exclude Listed partitions names are separated by space or comma. Only configured partition names (and not paths) are expected. Note: Example of permanent partition exclusionWith the following configuration in */etc/imapd.conf*, "parta" and "partb" would be permanently excluded from being selected:partition-part1: /path/to/part1 partition-part2: /path/to/part2 partition-part3: /path/to/part3 partition-part4: /path/to/part4 partition-parta: /path/to/parta partition-partb: /path/to/partbpartition_mode_exclude: parta partbWhen using a selection mode other than "random", partitions are automatically excluded if their usage percentage is beyond the "partition_mode_soft_usage_limit" integer setting. * partition_mode_soft_usage_limit If all partitions are beyond the configured value, this feature is automatically deactivated. A partition is thus selected as if the setting was not set. Note: Example of partition exclusion using "partition_mode_soft_usage_limit"In the same conditions than , setting "partition_mode_soft_usage_limit" to "50" would exclude partitions "part1" and "part3" since their disk usage is respectively 60% and 70%.But setting the option to "20" would have no effect, since the usage of all partitions is beyond 20%. Warning: **Exclusion is not absolute**Partitions are only excluded when creating a new user mailbox according to the configured selection mode.If you explicitly specify an excluded partition, the mailbox will be created on that partition.Mailboxes previously created on such partitions are of course still accessible, and subfolders are by default still created on the same partition as the parent folder. Partitions Usage Data Reset =========================== By default partitions usage data are retrieved only once upon service initialization. This only concerns selection modes other than "random". If you tend to use the same service instance for a long lapse of time and performs a large amount of mailboxes creation, it may be useful to configure the "partition_mode_usage_reinit" so that partitions usage data are refreshed after the configured number of creation requests. Mailbox Creation Distribution Through "murder frontend" ======================================================= Upon creating a user mailbox, if the target server is not given as extra parameter, the mailbox is either created on * the configured "defaultserver" backend * the most fitting backend among the servers listed in the "serverlist" setting, if "defaultserver" is not configured Selection Mode -------------- Among the backends, how the most fitting one is selected depends on the configured selection mode: "serverlist_mode" setting in "/etc/imapd.conf". The principle is similar to the mailbox creation distribution on backend (see ). Available Selection Modes on Frontend ------------------------------------- * random Choice is (pseudo-)random. Each backend has the same probability of being selected. * freespace-most The backend which has the most free space (counted in KiB units) is selected. The considered free space is the sum of all available partitions free space on the backend. Note: Example of "freespace-most" Selection on FrontendSuppose that the configured backends are: * "backend1" which has a 2000GiB capacity, 1000GiB being free (that is, 50% free space), composed of 2 partitions * "part1" which has a 1000GiB capacity, 500GiB being free (that is, 50% free space) * "part2" which has a 1000GiB capacity, 500GiB being free (that is, 50% free space) * "backend2" which has a 2000GiB capacity, 900GiB being free (that is, 45% free space), composed of 2 partitions * "part1" which has a 1000GiB capacity, 200GiB being free (that is, 20% free space) * "part2" which has a 1000GiB capacity, 700GiB being free (that is, 70% free space) * "backend3" which has a 200GiB capacity, 110GiB being free (that is, 55% free space), composed of 2 partitions * "part1" which has a 100GiB capacity, 30GiB being free (that is, 30% free space) * "part2" which has a 100GiB capacity, 80GiB being free (that is, 80% free space) * In that case "backend1" will be selected as most fitting, since 1000GiB of free space is the biggest of all backends. * freespace-percent-most On each backend, the partition with the most percentage of free space is considered. The selected backend is the one whose partition has the most percentage of free space. Note: Example of "freespace-percent-most" Selection on FrontendIn the same conditions than "backend3" would be selected as most fitting, since it has a partition with 80% of free space which is the biggest of all backends. * freespace-percent-weighted On each backend, the partition with the most percentage of free space is considered: it is the backend weight. Then a weighted choice is performed to select one of the backends. Note: Example of "freespace-percent-weighted" Selection on FrontendIn the same conditions than , the weight of each backend would be: * 50 for "backend1" * 70 for "backend2" * 80 for "backend3" The sum of each weight being 200, the probability for each backend to be selected as most fitting would be: * 25% for "backend1" * 35% for "backend2" * 40% for "backend3" * freespace-percent-weighted-delta On each backend, the partition with the most percentage of free space is considered. As for "freespace-percent-weighted", a weight is associated to each backend. It is computed as follows: (*percentage of freespace on backend*) - (*lowest percentage of freespace of all backends*) + 0.5 Then a weighted choice is performed to select one of the backends. Note: **Example of ``freespace-percent-weighted-delta`` Selection on Frontend**In the same conditions than , the weight of each backend would be: * 50 - 50 + 0.5 = 0.5 for "backend1" * 70 - 50 + 0.5 = 20.5 for "backend2" * 80 - 50 + 0.5 = 30.5 for "backend3" Then the probability for each backend to be selected as most fitting would be: * 1.0% for "backend1" * 39.8% for "backend2" * 59.2% for "backend3" Backends Exclusion ================== When using a selection mode other than "random", backends are automatically excluded if their considered usage percentage is beyond the "serverlist_mode_soft_usage_limit" integer setting. * "partition_mode_soft_usage_limit" If all backends are beyond the configured value, this feature is automatically deactivated. A backend is thus selected as if the setting was not set. Note: Example of partition exclusion using "serverlist_mode_soft_usage_limit"In the same conditions than , using "freespace-most" selection mode, setting "serverlist_mode_soft_usage_limit" to "49" would exclude "backend1" and "backend2" since in that mode they have a disk usage of 50% and 55%. In other modes it would however only exclude "backend1" whose considered partition has a disk usage of 50%, while on "backend2" the considered partition has a disk usage of 30%. Warning: **Exclusion is not absolute**Backends are only excluded when creating a new user mailbox according to the configured selection mode.If you explicitly specify an excluded backend, the mailbox will be created on that backend.Mailboxes previously created on such backends are of course still accessible. Backends Usage Data Reset ========================= By default backends usage data are retrieved only once upon service initialization. This only concerns selection modes other than "random". If you tend to use the same service instance for a long lapse of time and performs a large amount of mailboxes creation, it may be useful to configure the "serverlist_mode_usage_reinit" so that backends usage data are refreshed after the configured number of creation requests.