PMML 4.0 - Changes from PMML 3.2
Includes changes from 4.0.0 to 4.0.1 which are noted by [4.0.1]
Note: 4.0.1 was a “service pack” release to fix some problems and
inconsistencies with the original 4.0.0 release. Since 4.0.1 was not a
major release, and since it replaced the original 4.0.0 release, it is
the 4.0 version.
Associations
- Itemset and AssociationRule elements are no longer enclosed within a
"Choice" element
- Added different scoring procedures: recommendation,
exclusiveRecommendation and ruleAssociation with explanation and
example
- Changed version to "4.0" from "3.2" in the example(s)
BuiltinFunctions
- Added functions for boolean operations: isMissing, isNotMissing,
equal, notEqual, lessThan, lessOrEqual, greaterThan, greaterOrEqual,
and, or, not, isIn, isNotIn
- [4.0.1] Added if-then-else construct
- [4.0.1] Deleted extra comma in first paragraph
- [4.0.1] Added an extra line after table header for pseudo-declarations
and examples for new built-in functions so that they look exactly as all other functions.
- [4.0.1] Deleted the "s" in word "Returns" to "Return" for new built-in functions
isMissing,isNotMissing and equal,notEqual,...greaterOrEqual. More than
one function, verb conjugation was wrong before.
- [4.0.1] Added "/" to pseudo declaration PMML for the following built-in
functions: +, log10, pow, floor, isIn, uppercase, trimBlanks,
formatNumber,formatDatetime. As it was, pseudo was wrong.
- [4.0.1] Deleted extra space for pseudo declaration for function lessThan.
- [4.0.1] Deleted extras "/" from example for function substring - .
As it was, example was wrong.
- [4.0.1] Removed link for "Posix datetime descriptors" in function formatDateTime since it was broken.
ClusteringModel
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
Conformance
- Changed all version references from "3.2" to "4.0"
DataDictionary
Functions
GeneralRegression
- Suggested changes to allow Cox survival models and model ensembles
- Add new model type: CoxRegression.
- Allow empty regression model when model type is CoxRegression, so that
baseline-only model could be represented.
- Add new optional model attributes: endTimeVariable, startTimeVariable,
subjectIDVariable, statusVariable, baselineStrataVariable, modelDF.
- Add optional Matrix in Predictor to specify a contrast matrix, optional
attribute referencePoint in Parameter.
- Add new elements: BaseCumHazardTables, EventValues, BaselineStratum,
BaselineCell.
- Add examples of scoring for Cox Regression and contrast matrices.
- Add new type of distribution: tweedie.
- Add new attribute in model: targetReferenceCategory, so that the model can
be used in MiningModel.
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
GeneralStructure
- Updated the XML namespace to "https://www.dmg.org/PMML-4_0"
from "https://www.dmg.org/PMML-3_2"
- Added TimeSeriesModel to the PMML XSD
Header
Interoperability
- Changed: "As a result, a new approach for interoperability
was required and is being introduced in PMML version 3.2." to
"As a result, a new approach for interoperability was
introduced in PMML version 3.2."
MiningSchema
- Added frequencyWeight and analysisWeight as new options
for usageType. They will not affect scoring, but will make model
information more complete.
- [4.0.1] Added the text to clarify TransformationDictionary and MiningSchema in V4.0
which was approved by DMG after many discussions.
ModelComposition -- No longer used, replaced by MultipleModels
ModelExplanation
- New addition to PMML 4.0 that contains information to explain the
models, model fit statistics, and visualization information.
- [4.0.1] Changed ModelExplanation element to allow multiple PredictiveModelQuality
and ClusteringModelQuality elements.
ModelVerification
MultipleModels
- Replaces ModelComposition. Important additions are segmentation and
ensembles.
- Added reference to ModelExplanation element in the model XSD
- [4.0.1] Added the missing MiningSchemas in the models inside Segments
- [4.0.1] Replaced "DecisionTree" type with the correct "TreeModel".
- [4.0.1] Renamed embedded space in variable name with with an underscore, e.g.,
"sepal width" to "sepal_width".
NaïveBayes
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
NeuralNetwork
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
Output
- Extended output type to include Association rule models. The changes
add a number of new attributes: "ruleFeature", "algorithm",
"rank", "rankBasis", "rankOrder" and "isMultiValued". A new enumeration
type "ruleValue" is added to the RESULT-FEATURE
- [4.0.1] Removed the unused SelectionResult XSD in the comments which was not
being stripped out correctly.
Regression
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
RuleSet
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
Sequence
- Changed version to "4.0" from "3.2" in the example(s)
Statistics
- accommodate weighted counts by replacing INT-ARRAY with NUM-ARRAY
in DiscrStats and ContStats
- change xs:nonNegativeInteger to xs:double in several
places
- add new boolean attribute 'weighted' to UnivariateStats and PartitionFieldStats
elements
- add new attribute cardinality in Counts
- Also some very long lines in this document are now wrapped.
SupportVectorMachine
- Added optional attribute threshold
- Added optional attribute classificationMethod
- Attribute alternateTargetCategory removed from
SupportVectorMachineModel element and moved to
SupportVectorMachine element
- Changed the example slightly
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
Targets
Taxonomy
- Changed: "A TableLocator may contain any description which helps
an application to locate a certain table. PMML 3.2 does not yet
define the content. PMML users have to use their own extensions.
The same applies to InlineTable." to "A TableLocator may contain any
description which helps an application to locate a certain table.
PMML standard does not yet define the content. PMML users have
to use their own extensions. The same applies to InlineTable."
Text
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
- [4.0.1] Changed sentence: "id is used for later referenced towards the
document" to "id is used for a later reference towards the document".
- [4.0.1] Changed attribute "numberOfFields" in example from 1000 to 6.
TimeSeriesModel
- New addition to PMML 4.0 to support Time series models
- [4.0.1] Corrected the errors in the example and the text.
Transformations
- [4.0.1] Added the text to clarify TransformationDictionary and MiningSchema in V4.0
which was approved by DMG after many discussions.
TreeModel
- Changed version to "4.0" from "3.2" in the example(s)
- Added reference to ModelExplanation element in the model XSD
- [4.0.1] Corrected the example (V4.0 Bugs item #2880206)