This topic covers the Diff tool, which compares saved data with incoming data and reports differences in SOAtest and Virtualize.Sections include:
Understanding Diff
With support for Binary, Text, XML, and JSON, the Diff tool is the current backbone of a best-practices regression testing solution.The Diff tool compares the data in its editor against the data that it receives from another tool's output or from another input that you specify and reports a failure when it finds a difference.
The Diff Project Configuration panel is divided into two main sections. The upper section contains general options. The bottom section contains the Regression Control, Ignored Differences, and Options tabs in which you can configure the options for comparing data in text, binary, JSON, and XML formats.
As with other tools, you can specify a data source for the tool if a source was added to the project. The data source for the Diff tool specifies the Data Source to be used for providing control values. For more information on Data Sources, see Parameterizing Tests with Data Sources, Variables, or Values from Other Tests.
Binary Diff Mode
Choose Binary from the Diff Mode menu to view differences for binary files.
Regression Control
This setting determines what data source values, file, or text is used as the "control" value (the value against which it will compare all subsequent results). Choose one of the following options.
Editor: Choose this option from the menu and enter text you want to use as a regression control in the related field.
File: Choose this option from the menu and specify a file to use as a regression control. If you want to ensure that this file's path is always relative to your project file, enable the Persist as Relative Path option.
Data Source Column: Choose this option from the menu and specify a value from the designated data source column as regression controls. This option is only available if your project includes a data source.
Text Diff Mode
Choose Text from the Diff Mode menu to access the following configuration options for this mode.
Regression Controls
Determines what data source values, file, or text is used as the "control" value (the value against which it will compare all subsequent results). Choose one of the following options.
Editor: Choose this option from the menu and enter text you want to use as a regression control in the related field.
File: Choose this option from the menu and specify a file to use as a regression control. If you want to ensure that this file's path is always relative to your project file, enable Persist as Relative Path.
Data Source Column: Choose this option from the menu and specify a value from the designated data source column as regression controls. This option is only available if your project includes a data source.
Ignored Differences
From the Ignored Differences tab of the Diff tool for Text mode, you can delete any ignored text differences by clicking the Delete button.
Options
The following options are available in the Options tab of the Diff tool for Text mode.
- Regular Expression: Determines whether the expected value will be parsed as a regular expression. If this option is not enabled, the control value will not be parsed as a regular expression. If this option is enabled, the control value will be parsed as a regular expression.
For example, the following regular expression may be entered:Java[a-zA-Z ]+\Q(\E[4-9]+th Edition\Q)\E
Where the actual string extracted may be the following: Java How to Program (4th Edition)
The regular expression will parse through the above string to check if "Java" appears as the first substring, followed by one or more occurrences (indicated by the + operator after the open and closed brackets) whereas the occurrences are limited to only letters ranging between a-z and A-Z and whitespace. Notice we escaped ( and ) by adding \Q and \E around ( and ). This needs to be done because open and closed parentheses can be part of a regular expression, therefore they need to be escaped if they are represented to be substrings. We also have a digit appear after the first open parenthesis which is indicated by [4-9]+. This specifies at least one occurrence of a digit between 4 and 9 that should appear and be followed by "th Edition". Lastly, we have the closed parenthesis which we escape with \Q\E. For more information regarding supported regular expression syntax, click here. - Ignore Whitespace: Determines whether empty lines and whitespaces at the end and beginning of input lines and diff control lines are ignored. If this option is not enabled, empty lines and leading/trailing whitespaces will cause the regression test to fail. If this option is enabled, empty lines and leading/trailing whitespaces will be ignored.
A whitespace is any of the following: horizontal tabulation, new line, form feed, carriage return, space. An empty line is a line that contains one or more of these whitespaces. - Output results as UNIX-style diff: Determines the diff output format. If this option is not enabled, the output will display in table form. If this option is enabled, the output will display in UNIX-style.
XML Diff Mode
Choose XML diff from the Diff Mode menu to access options for XML formats. XML mode parses XML files and compares them element by element and attribute by attribute (ignoring “ignorable” whitespace). The result of the comparison is expressed as an XML document. If you want to use this mode, make sure that both inputs are well-formed XML documents which have the same type of document element.
Regression Control
Determines what data source values, file, or text is used as the "control" value (the value against which it will compare all subsequent results). The Regression Control settings include two graphical representations to configure XML messages.
Literal XML Regression Controls
The following options are available in the Regression Control tab of the Diff tool for Literal XML mode.
Editor: Choose this option from the menu and enter text you want to use as a regression control in the related field.
File: Choose this option from the menu and specify a file to use as a regression control. If you want to ensure that this file's path is always relative to your project file, enable Persist as Relative Path.
Data Source Column: Choose this option from the menu and specify a value from the designated data source column as regression controls. This option is only available if your project includes a data source.
- Set From WSDL: Initializes the Form XML content with the expected response based on the WSDL. This button is only available if a WSDL document exists for the particular SOAP client.
Form XML Regression Controls
When Form XML is selected as the regression control mode, the options in the Regression Control tab are divided into an XML View tree and XML configuration tabs.
The XML tree displays the literal XML as a tree with each tree node representing an element. The options for the tree in the Diff panel can be configured in the same fashion as the Form XML SOAP Envelope options of the SOAP Client tool. For more information, see Manipulating the XML View Tree.
The XML Configuration Tabs allow you to add, remove, and rename XML components. The options for the XML Configuration Tabs can be configured in the same fashion as the Form XML SOAP Envelope options of the SOAP Client tool. For more information, see Manipulating the XML Configuration Tabs.
XML Ignored Differences
From the Ignored Differences tab of the Diff tool for XML mode, you can Add and Modify XPath settings by clicking the appropriate buttons. For more information on configuring Ignored Differences, see Understanding XPaths.
XML Options
The following options are available in the Options tab of the Diff tool for XML mode.
- Diff engine: Choose a diff engine from the menu. XMLUnit is the default engine. This engine provides support for large messages, comparisons that generate a large number of differences, and the ability to ignore changes in element order. If desired, you can switch to VMTools.
- Ignore element order: Configures the tool to ignore element order throughout all areas of every processed document. Note that if you’d rather ignore order for only a specific part of the document, use the XML Sort tool to sort that part before passing the document to the Diff tool. See XML Sort for details.
- SOAP mode: If this option is enabled, the following actions are enabled:
- SOAP multi-references are resolved before diffing and are not reported as errors. For example, some services (such as Axis) rearrange the XML responses in an unpredictable way when using SOAP multi-refs (that is, giving different id numbers to the references). When such responses are diffed in text mode, failures occur even though the SOAP messages are logically equivalent, but their references are different.
- Namespace prefix changes in type and arrayType attributes are ignored. This is needed when the control is generated automatically from the WSDL because prefixes cannot be determined in advance.
- Numerical values are compared as numbers. For example, the difference between 1 and 1.0 would not be reported as an error.
- Ignore differences in comments: Disable this option to include changes in XML comments. It is enabled by default.
Using VMTools
VMTools is built into the product. No additional license is needed. If you select VMTools, you can specify the following options:
Output results as XML: Determines the diff output format. If this option is not enabled, the output will display in table form. If this option is enabled, the output will display in XML. Differences are only reflected when another tool is chained to the message output of the Diff tool.
- SOAP mode: If this option is enabled, the following actions are enabled:
- SOAP multi-references are resolved before diffing and are not reported as errors. For example, some services (such as Axis) rearrange the XML responses in an unpredictable way when using SOAP multirefs (that is, giving different id numbers to the references). When such responses are diffed in text mode, failures occur even though the SOAP messages are logically equivalent, but their references are different.
- Namespace prefix changes in type and arrayType attributes are ignored. This is needed when the control is generated automatically from the WSDL because prefixes cannot be determined in advance.
- Numerical values are compared as numbers. For example, the difference between 1 and 1.0 would not be reported as an error.
Under certain conditions, VMTools will fall back to a text-based diff. This can happen if the XML is very complex and produces an excessive number of differences or would take an excessive amount of time to complete.
If this occurs, one of the following messages will be printed to the Console view to explain what happened.
Did not run XML Diff, there are too many differences. Performing text mode diffing instead.
Did not run XML Diff, it did not finish within 30.0 secs. Performing text mode diffing instead.
The maximum time allowed to perform an XML diff before switching to text mode is the default timeout value specified in the preferences (go to Parasoft > Preferences > Misc and see the Default timeout (milliseconds) setting), but no higher than 120 seconds.
JSON Diff Mode Options
When JSON is selected from the Diff Mode menu of the Diff tool, the Regression Control and Ignored Differences options are available.
Regression Control
The following options are available in the Regression Control tab of the Diff tool for JSON mode.
- Name: Specifies the name of the Diff tool.
Regression Control Source: Determines what data source values, file, or text is used as the "control" value (the value against which it will compare all subsequent results).
Editor: Choose this option from the menu and enter text you want to use as a regression control in the related field.
File: Choose this option from the menu and specify a file to use as a regression control. If you want to ensure that this file's path is always relative to your project file, enable Persist as Relative Path.
Data Source Column: Choose this option from the menu and specify a value from the designated data source column as regression controls. This option is only available if your project includes a data source.
Ignored Differences
From the Ignored Differences tab of the Diff tool for JSON mode, you can Add and Delete XPath settings by clicking the appropriate buttons.
Understanding XPaths
When the Diff tool is configured in the XML mode, any differences found between the actual value and the expected value are expressed in XPath. XPaths represent the position of an XML element and specifies where a difference occurred, the type of difference that occurred, and whether that difference was caused by a modification, an insertion, or a deletion.
You can determine which XPaths to ignore when running the Diff tool. You can choose to either ignore an entire XPath, or you can ignore a specific XPath operation such as a Content: Insert
operation.
Ignoring XPaths is useful for ignoring transient values that would normally cause the regression test to fail. For example, the actual outcome may contain a timestamp value that is constantly changing. Since this value is never the same, it will more than likely not match the value you configure in the Diff tool, causing the test to fail. Therefore, you can specify the XPath of this value to be ignored so that the test will not fail.
There are three different ways you can set XPaths to Ignore:
- Ignore XPaths from the Quality Tasks view: This is the easiest way to configure the Diff tool to ignore XPaths. After a failed regression test, you can simply right-click the Quality Tasks view node to choose an XPath to ignore.
- Manually entering an XPath in the Diff GUI: You can manually enter an XPath by clicking Configure Ignored Differences in the Diff GUI.
- Ignore XPaths from the Form XML tree: You can right-click an element node from the Form XML tree and configure an XPath to ignore based on the selected element.
Ignoring XPaths from the Quality Tasks View
When entering XPaths to ignore in the Diff tool, it is easier to right-click in the Quality Tasks view than manually entering the XPath position into the Diff tool GUI. XPath positions are displayed in tree form in the Quality Tasks view after a failed regression test:
To ignore an XPath from the Quality Tasks view after a failed regression test:
- Right-click the error and choose Ignore XPath. An Ignored XPath Settings dialog opens with the selected XPath automatically entered.
- Enable the options related to the XPath operations you would like to ignore. The following options are available:
- XPath: Specifies the XPath position that you selected.
- Recursive: Select to apply the Ignored XPath settings to child elements.
- Text Content (Modify/Insert/Delete): Select the content operation you want to ignore.
- Element/Subtree (Insert/Delete): Select the element or subtree operation you want to ignore.
- Attribute (Modify/Insert/Delete): Select the attribute operation you want to ignore. If this option is enabled, SOAtest will only ignore the specified attribute name within the XPath. To ignore a specific attribute, enter the attribute name in the field next to the Attribute option. If you want to ignore more than one attribute at an element’s XPath location, leave the attribute name empty or use the wild card * (for example, myAttribute*).
- Element Name and Namespace (Modify/Insert/Delete): Select the element name operation you want to ignore.
- Click OK.
The XPath you specified will now be ignored for any future test runs. In addition, the XPath you specified now appears in the Ignored Differences tab of the XML Mode in the Diff tool. To modify the XPath, see Modifying XPath Settings.
Manually Entering XPaths to Ignore
You can also manually type or paste an XPath to ignore into the Diff tool configuration panel.
To manually enter an XPath:
- Select the Ignored Differences tab within the Text, XML, or JSON mode of the Diff tool.
- Click Add. An empty field appears in the XPath column of the Ignored XPaths list. By default, the Settings column is automatically filled in with all XPath operations specified, meaning that the entire XPath you add will be ignored. To specify a single XPath operation to ignore, see Ignoring XPaths from the Form XML Tree.
- Enter an XPath position in the empty XPath field.
- To add additional XPaths, repeat steps 1 through 3.
- Click OK.
All XPaths added will be ignored in future runs of the modified regression test. For information on modifying ignored XPaths, see Modifying XPath Settings.
Ignoring XPaths from the Form XML Tree
You may also ignore XPaths directly from the Form XML tree within the Form XML tab by completing the following:
- Right-click an element node from the tree and choose Setup Ignored XPaths.
- If the element you selected from the Form XML tree is not a repeated element, the Ignored XPaths Settings dialog box opens with the selected XPath automatically entered.
- If the element you selected from the Form XML tree is a repeated element that has siblings with the same local name, a Repeated XPath Settings dialog displays.
You can either select to Ignore All of the repeated elements or Ignore Selected Element Only. After making your selection and clicking OK, the Ignored XPaths Settings dialog box displays with the selected XPath automatically entered.
- If the element you selected from the Form XML tree is not a repeated element, the Ignored XPaths Settings dialog box opens with the selected XPath automatically entered.
- Enable the appropriate options of the XPath operations you would like to ignore. The following options are available:
- XPath: Specifies the XPath position that you selected.
- Recursive: Select to apply the Ignored XPath settings to child elements.
- Text Content (Modify/Insert/Delete): Select the content operation you want to ignore.
- Element/Subtree (Insert/Delete): Select the element or subtree operation you want to ignore.
Attribute (Modify/Insert/Delete): Select the attribute operation you want to ignore. If this option is enabled, only the specified attribute name within the XPath will be ignored. To ignore a specific attribute, enter the attribute name in the field next to the Attribute option. If you want to ignore more than one attribute at an element’s XPath location, leave the attribute name empty or use the wild card * (for example, myAttribute*).
- Element Name and Namespace (Modify/Insert/Delete): Select the element name operation you want to ignore.
- Click OK.
The selected element node now displays in gray in the Form XML tree. All XPath operations specified will be ignored in future runs of the modified regression test.
To reset and no longer Ignore an XPath, right-click the Ignored XPath node and choose Remove Ignored XPath. The node will then display in black and will no longer be ignored.
Creating a Shared Ignored XPaths List
Diff tools can either have their own local, exclusive Ignored XPath settings, or they can reference a list of XPaths that can be used by other Diff tools.
Modifying XPath Settings
By default, when you add an XPath to the Ignored Differences tab, all of the possible operations for that XPath appear in the Settings column. Since all of the possible operations for the XPath are specified by default, all of these operations will be ignored the next time the regression test is run. However, you can specify which operations are ignored, rather than having all of them ignored, by modifying the XPath settings.
To modify XPath settings:
- In the Ignored Differences tab, select the XPath you would like to modify and click Modify. The Ignored XPaths Settings dialog box opens.
- Enable the options related to the XPath operations you would like to ignore. The following options are available:
- Recursive: Select to apply the Ignored XPath settings to child elements.
- XPath: Specifies the XPath position that you selected.
- Text Content (Modify/Insert/Delete): Select the content operation you want to ignore.
- Element/Subtree (Insert/Delete): Select the element or subtree operation you want to ignore.
Attribute (Modify/Insert/Delete): Select the attribute operation you want to ignore. If this option is enabled, only the specified attribute name within the XPath will be ignored. To ignore a specific attribute, enter the attribute name in the field next to the Attribute option. If you want to ignore more than one attribute at an element’s XPath location, leave the attribute name empty or use the wild card * (for example, myAttribute*).
- Element Name and Namespace (Modify/Insert/Delete): Select the element name operation you want to ignore.
- Click OK.
All XPath operations specified will be ignored in future runs of the modified regression test.
Video Tutorial
In this video, you'll learn how to generate and update regression controls.