Documentation
13.6. File Dispatcher Event Handler¶
13.6.1. Introduction¶
The file dispatcher is a generic event handler which is used to perform various actions on the files on which SFTPPlus operates.
As any event handler you can define all the available filtering rules for defining the scope on which the handler should be triggered.
It can be used for both server-side and client-side operations.
A common server-side use case is to move the file outside of a single inbox used by a remote account to push the files, and forward the files to a different internal team which needs a copy of the pushed file.
For the client-side, a common use case is to move or copy the pulled files to various local folders, once the file was copied from the remote server.
In this way there is a separation of concerns among various components which take part in a managed file transfer process.
One component will make sure the files are received and handled using various file transfer protocols like SFTP or FTPS. Once the files are received via the protocols, a separate component can be configured to further process the files, outside of the realm of FTPS or HTTPS file transfer protocols.
13.6.2. Operation Principle¶
The file-dispatcher event handler, when configured to be triggered for an event will use the path attribute associated with the event and perform an action on that path.
Not all events can be used with this event handler. Only events which have have an associated path on the local filesystem can be used.
For example, the events from the group file-operation can be used together with the file-dispatcher event handler, since those events have an event attribute pointing to path on the local filesystem.
Using the dispatch_attribute configuration you can define which attribute of the event to use for triggering the actions.
When used for server-side operations,
the file transfer actions have both a real and virtual path.
A remote file transfer client authenticated as username johnd
,
will upload a file using a path like /inbox/REPORT_123.PDF
,
this is the virtual path.
On the server, that file is actually stored as
/storage/users/john/inbox/REPORT_123.PDF
,
this is the real path.
The dispatch_attribute should be configured with the real path of the file
to be handled.
Using a virtual path will cause the file not to be dispatched appropriately as
it will use the virtual (non-existent) path as the source.
The file dispatching will fail if the event does not have the configured attribute.
Note
Please make sure that you are not attaching this event handler to file events for which the path was not yet closed. In such scenarios the move operation will fail.
When the event handler starts, it will try to validate the current configuration. The event handler will fail to start when the configuration is invalid.
Some configuration options can only be validated at execution time, at the moment when a certain action is triggered for an event. In such cases the event handling will fail and the failure count will be increased. Such a condition can be found when configuring the dispatch handler for an event ID which has no associated paths.
13.6.3. Matching expressions¶
The dispatch_rules will have a matching expression defined as the second member of the configuration.
This is used to decide the paths for which the action is triggered.
The matching expression is checked against the full path of the file.
As an example, if you want to match files based on the naming convention INV_NNNN.csv (like INV_0023.csv or INV_1202.csv), located in any directory, use a configuration similar to:
[event-handlers/a0cf-d14500774340]
dispatch_rules = move, */INV_*.csv, /srv/teams/invoicing
When you want to match only files inside a certain folder,
make sure to include that folder as part of the matching expression.
For example, a file named REPORT_123.csv
will be moved to
/srv/teams/invoicing
when placed into the source folder /invoicing
and
moved to /srv/teams/ops
when placed into the source folder
/operations
:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, /invoicing/REPORT_*.csv, /srv/teams/invoicing
move, /operations/REPORT_*.csv, /srv/teams/ops
Note
The fallback_rule has no configuration for the matching expression since this is used for files which are not matching any of the configured expressions.
For more details about the available expressions see the matching expression documentation.
13.6.4. Regular Expressions and Grouping¶
When regular expressions are defined for the path matching expression, you can use the parentheses for grouping parts of the source path which, once matched, can be used to dynamically define the destination path based on the source path.
Note
The exclusion regular expression mode (e/some/.*.pdf
) is not supported
for group substitution.
Note
The group substitution is not available for the fallback rule as there is no matching expression for that configuration.
For the purpose of this section, it is assumed that you are already familiar with the regular expression. Here you can read more about the regular expressions.
For example, to have a file which is pushed to a path
/inbox/accounting/john/SRV_123.PDF
and moved to the path
/reports/john/teams/accounting/SRV_123.PDF
use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, m/inbox/(.+)/(.+)/SRV_.+\.PDF/, /reports/{2}/teams/{1}/
The first pair of parentheses will match the source team name and the second pair will match the source username. You can then used the matched valued as {1} or {2} etc to generate the destination path.
The first group to be matched is {1}.
The group matching can be used the dynamically generate the destination directory or the generate the full destination path.
When groups are used in the configured destination and the configuration does not end with a path separator, it will use the configuration to generate the full path for the destination. Otherwise, it will dynamically generate only the destination directory and use the source filename to generate the full path.
For example, to have a file which is pushed to a path
/users/john/SRV_123.PDF
and moved to the path
/staging/john-SRV_123.PDF.in
use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, m/users/(.+)/(SRV_.+\.PDF)/, /staging/{1}-{2}.in
To let SFTPPlus generate the destination path and for example have the file
pushed as /user/john/SRV_123.PDF
moved to the path
/staging/john/SRV_123.PDF
, use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, m/users/(.+)/SRV_.+\.PDF/, /staging/{1}/
13.6.5. Destination path variables¶
The destination path can be defined to include values from a set of dynamic variables based on each event.
Assume you have 2 accounts named john-d
and jane-r
with home
directories in C:\SFTP-In\john-d
and C:\SFTP-In\jane-r
respectively.
When john-d
or jane-r
uploads a file named /reports.xml
inside their root folder, the actual file is received on disk as
C:\SFTP-In\john-d\app\report.xml
or
C:\SFTP-In\jane-r\app\report.xml
.
You might need to move both reports into a common internal application
inbox directory with file names like
C:\App-In\john-d_report.xml
or C:\App-In\jane-r_report.xml
.
This can be done using the following configuration, in which only the files placed inside the app sub-directories are moved:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, g/C:\SFTP-In\*\app\*/, C:\App-In\{account.name}-{data.file_name}
You can also define the following configuration, in which any file uploaded by a user is moved:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, g/.*/, C:\App-In\{account.name}-{data.file_name}
or defined using regular expression syntax with matching groups enabled:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, m/(.*)/i, C:\App-In\{account.name}-{data.file_name}
Note
The usage of explicit g/EXPRESSION/ globbing matching syntax or regular expression with groups is required for destination path variables.
This is required for backward compatibility with older configurations in which the destination configuration was defining only the base destination directory.
13.6.6. Destination path normalization¶
The destination paths are normalized and any forward slash or backward slash is automatically converted to the path delimiter used by the operating system hosting the STPPlus application.
For example if you have the following configuration on a Linux system,
the destination path is /staging/john/SRV_123.PDF
:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, m/users/(.+)/SRV_.+\.PDF/, \staging\{1}\
While with the following example on Windows
the destination path is C:\staging\john/SRV_123.PDF
:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, m/C:\\users\\(.+)/SRV_.+\.PDF/, c:/staging/{1}/
13.6.7. Advanced Regular Expressions¶
Using the matching_expressions configuration, you can create a destination path using multiple data sources, not only the path of the handled file.
At the same time, you can transform the values to uppercase or lowercase.
As an example, suppose we handle ZIP archives copied to /inbox/AcmeCo-reports.zip containing files /sales.csv and /returns.csv. The system is configured to automatically extract the files to the /storage/received/ path. Upon normal operation, this would result in /received/sales.csv and /storage/received/returns.pdf.
However, if you want to have these files extracted as /received/sales-AcmeCo.csv and /received/returns-AcmeCo.pdf, you can use the following configuration::
matching_expressions:
source_path, m//inbox/(\d{6})-.+.zip/i
dispatch_rules:
rename, m/.+/(.+\)\.([a-z])/i, /received/{1}-{source_path.1}.{2}
Using character case transform operations, you could have the files extracted as /received/sales-ACMECO.csv and /received/returns-ACMECO.pdf with the following configuration:
matching_expressions:
source_path, m//inbox/(\d{6})-.+.zip/i
dispatch_rules:
rename, m/.+/(.+\)\.([a-z])/i, /received/{1}-{source_path.1_upper}.{2}
13.6.8. Triggering the Dispatch Rule¶
For the dispatch_rules configuration option you can specify multiple rules, one per line.
Multiple rules are checked in the order of their definition.
The dispatch will stop after the first rule which matches the current file path.
When none of the configured path matching expression could match the path associated with the handled event, the fallback rule is used.
When no fallback rule is defined and none of the configured expression matches the path no action is taken by the event handler.
13.6.9. Events with multiple paths¶
Some events emitted by SFTPPlus have associated multiple paths.
You can still use the file dispatcher together with those events.
The actions are executed for each of the associated files.
The event handling operation is aborted when failing to perform the configured action for one of the associated files. In this case the remaining files are not actioned.
13.6.10. File Overwrite Prevention¶
The file dispatcher will not overwrite existing file and the whole dispatch process will fail if one of the configured destinations already contains a file with the same name.
13.6.11. Handling of temporary / transient files¶
It is common to have an external client or process pushing a file using a temporary name and rename to the final name once the transfer is complete.
In this case, you can use the dispatch_delay option to configure the file dispatcher to execute the dispatching with a delay. This will allow the file to be created with a final name.
13.6.12. Retry on errors¶
The file dispatcher can be configured to retry the operation on error.
It can wait a configurable number of seconds before retrying.
13.6.13. Automatically creating destination folders¶
The default behaviour of the file dispatcher event handler is to fail when the destination does not exist.
The create_destination_folder configuration option can be used to automatically create the destination.
To create the parent for the destination folder, set the event handler as the following example. To prevent accidental configurations, the event handler will still fail if the path to the parent folder does not exist:
[event-handlers/f040be6a-e158]
create_destination_folder = parent
13.6.14. Copy Action¶
The copy action will create copies of the source file in each of the configured destinations.
If for any reason the file fails to be successfully copied to any of the destination, the process of copying to the remaining destination is aborted.
For example, to have a file named RPT_123.xml copied to both
/data/teams/invoicing/RPT_123.xml
and /data/teams/ops/RPT_123.xml
,
use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
copy, */rpt_*.xml, /srv/teams/invoicing, /srv/teams/ops
13.6.15. Move Action¶
The move action will create copies of the the source file in each of the configured destinations.
Once the file is copied to all the destinations, it will delete the source file. This is the reason why it is named move.
If for any reason the file fails to be successfully copied to any of the destination, the source file is not removed.
For example, to have a file named RPT_123.xml copied to both
/data/teams/invoicing/RPT_123.xml
and /data/teams/ops/RPT_123.xml
and
then removed from the source, use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
move, */rpt_*.xml, /srv/teams/invoicing, /srv/teams/ops
13.6.16. Move With Timestamp Action¶
The move-with-timestamp is similar to the move action but files are created in the destination with a timestamp inserted at the end of the filename but before the file extension.
When the file has no extension, the timestamp is suffixed.
This action is useful to dispatch files and mitigate the risk of overwriting existing files.
The timestamp has a sub-seconds resolution. The resolution is depended on the host operating system.
Besides the sub-second resolution, the timestamp includes a random number so that even when you have multiple files generated in the same sub-second, they will still have different timestamps.
For example, to have a file named RPT_123.xml copied to both
/data/teams/invoicing/RPT_123.2013-02-24-16-50-43-983422-042.xml
and
/data/teams/ops/RPT_123.2013-02-24-16-50-43-983422-042.xml
and
then removed from the source, use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
move-with-timestamp, */rpt_*.xml, /srv/teams/invoicing, /srv/teams/ops
13.6.17. Rename Action¶
The rename action does a rename of source path to destination path.
The rename is done on the full path. You can use it to move a file from one path to another.
It is called renamed, since it can only be used with a single destination. This is done in order to differentiate it from our other move actions, which are performed on multiple destinations.
As long as the source and destination are on the same filesystem, the operation will be atomic and instant.
The operation will fail if the destination already exists.
For example, to have a file named RPT_123.xml
pushed to /inbox/reports
folder,
renamed to /data/teams/ops/RPT_123.xml
,
use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
rename, /inbox/reports/rpt_*.xml, /srv/teams/ops
13.6.18. Rename with Unix Timestamp Prefix Action¶
The rename-prepend-unixtime action does a rename of source path to destination path.
The renaming is done with prepending the Unix timestamp (with milliseconds) to the source filename.
For example, to have a file named RPT_123.xml
pushed to /inbox/reports
folder,
renamed to /data/teams/ops/0031550404.009876-RPT_123.xml
,
use the following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
rename-prepend-unixtime, /inbox/reports/rpt_*.xml, /srv/teams/ops
13.6.19. Delete Action¶
The delete action will delete the source path. It does not require any configured destinations.
13.6.20. Ignore Operation Action¶
The ignore action will not perform any action on the path. It does not require any configured destinations.
This action can be used to implement exceptions from a more generic rule that might be configured in the same event handler.
For example, to have any XML file moved to another directory, with the exception of XML files starting with test- you can use something similar to following configuration:
[event-handlers/f040be6a-e158]
dispatch_rules =
ignore, /import/test-*.xml
move, /inbox/*.xml, /srv/db-app/import
13.6.21. Execute Action¶
The execute action allows you to call an external executable or script and pass a set of arguments that most often will include the source file path.
This action is useful to reuse the file name matching selection present in the dispatch files and integrate it with custom external actions.
It is configured as a comma separated list of values, to make it easy to handle spaces in the executable source path or the targeted file.
The first value is the path to the executable or script that is invoked for the configured file pattern.
The remaining values are arguments used when calling the executable. Remember to add commas between arguments.
For example, we move a local file to an S3 bucket using the AWS CLI. It assumes that you are running SFTPPlus in an EC2 for which the S3 bucket role was enabled at the instance level:
[event-handlers/f040be6a-e158]
target = 40017
executable_timeout: 60
dispatch_rules =
execute, *.csv, /bin/aws, s3, move, {data.real_path}, https://test-bucket/{data.file_name}, --acl=private
It expects the execute command to finalize after executable_timeout seconds and to exit with code 0.
Any other exit code is considered an error and it will retry to process the file based on the configured retry rules.
The following environment variables are set when calling the external executable:
SOURCE_PATH - path to the file that is dispatched
ARGUMENTS - space separated list of configured arguments for the dispatcher