Changelog
#
Develop#
0.13.33- [FEATURE] Add optional ge_cloud_mode flag to DataContext to enable use with Great Expectations Cloud.
- [FEATURE] Rendered Data Doc JSONs can be uploaded and retrieved from GE Cloud
- [FEATURE] Implement InferredAssetAzureDataConnector with Support for Pandas and Spark Execution Engines (#3372)
- [FEATURE] Spark connecting to Google Cloud Storage (#3365)
- [FEATURE] SparkDFExecutionEngine can load data accessed by ConfiguredAssetAzureDataConnector (integration tests are included). (#3345)
- [FEATURE][MER-293] GE Cloud Mode for DataContext (#3262) (Thanks @roblim)
- [BUGFIX] Allow for RuntimeDataConnector to accept custom query while suppressing temp table creation (#3335) (Thanks @NathanFarmer)
- [BUGFIX] Fix issue where multiple validators reused the same execution engine, causing a conflict in active batch (GE-3168) (#3222) (Thanks @jcampbell)
- [BUGFIX] Run batch_request dictionary through util function convert_to_json_serializable (#3349) (Thanks @NathanFarmer)
- [BUGFIX] added casting of numeric value to fix redshift issue #3293 (#3338) (Thanks @sariabod)
- [DOCS] Docusaurus - How to connect to an MSSQL database (#3353) (Thanks @NathanFarmer)
- [DOCS] GREAT-195 Docs remove all stubs and links to them (#3363)
- [MAINTENANCE] Update azure-pipelines-docs-integration.yml for Azure Pipelines
- [MAINTENANCE] Update implemented_expectations.md (#3351) (Thanks @spencerhardwick)
- [MAINTENANCE] Updating to reflect current Expectation dev state (#3348) (Thanks @spencerhardwick)
- [MAINTENANCE] docs: Clean up Docusaurus refs (#3371)
#
0.13.32- [FEATURE] Add Performance Benchmarks Using BigQuery. (Thanks @jdimatteo)
- [WIP][FEATURE] add backend args to run_diagnostics (#3257) (Thanks @edjoesu)
- [BUGFIX] Addresses Issue 2937. (#3236) (Thanks @BenGale93)
- [BUGFIX] SQL dialect doesn't register for BigQuery for V2 (#3324)
- [DOCS] "How to connect to data on GCS using Pandas" (#3311)
- [MAINTENANCE] Add CODEOWNERS with a single check for sidebars.js (#3332)
- [MAINTENANCE] Fix incorrect DataConnector usage of _get_full_file_path() API method. (#3336)
- [MAINTENANCE] Make Pandas against S3 and GCS integration tests more robust by asserting on number of batches returned and row counts (#3341)
- [MAINTENANCE] Make integration tests of Pandas against Azure more robust. (#3339)
- [MAINTENANCE] Prepare AzureUrl to handle WASBS format (for Spark) (#3340)
- [MAINTENANCE] Renaming default_batch_identifier in examples #3334
- [MAINTENANCE] Tests for RuntimeDataConnector at DataContext-level (#3304)
- [MAINTENANCE] Tests for RuntimeDataConnector at DataContext-level (Spark and Pandas) (#3325)
- [MAINTENANCE] Tests for RuntimeDataConnector at Datasource-level (Spark and Pandas) (#3318)
- [MAINTENANCE] Various doc patches (#3326)
- [MAINTENANCE] clean up imports and method signatures (#3337)
#
0.13.31- [FEATURE] Enable
GCS DataConnector
integration withPandasExecutionEngine
(#3264) - [FEATURE] Enable column_pair expectations and tests for Spark (#3294)
- [FEATURE] Implement
InferredAssetGCSDataConnector
(#3284) - [FEATURE]/CHANGE run time format (#3272) (Thanks @serialbandicoot)
- [DOCS] Fix misc errors in "How to create renderers for Custom Expectations" (#3315)
- [DOCS] GDOC-217 remove stub links (#3314)
- [DOCS] Remove misc TODOs to tidy up docs (#3313)
- [DOCS] Standardize capitalization of various technologies in
docs
(#3312) - [DOCS] Fix broken link to Contributor docs (#3295) (Thanks @discdiver)
- [MAINTENANCE] Additional tests for RuntimeDataConnector at Datasource-level (query) (#3288)
- [MAINTENANCE] Update GCSStoreBackend + tests (#2630) (Thanks @hmandsager)
- [MAINTENANCE] Write integration/E2E tests for
ConfiguredAssetAzureDataConnector
(#3204) - [MAINTENANCE] Write integration/E2E tests for both
GCSDataConnectors
(#3301)
#
0.13.30- [FEATURE] Implement Spark Decorators and Helpers; Demonstrate on MulticolumnSumEqual Metric (#3289)
- [FEATURE] V3 implement expect_column_pair_values_to_be_in_set for SQL Alchemy execution engine (#3281)
- [FEATURE] Implement
ConfiguredAssetGCSDataConnector
(#3247) - [BUGFIX] Fix import issues around cloud providers (GCS/Azure/S3) (#3292)
- [MAINTENANCE] Add force_reuse_spark_context to DatasourceConfigSchema (#3126) (thanks @gipaetusb and @mbakunze)
#
0.13.29- [FEATURE] Implementation of the Metric "select_column_values.unique.within_record" for SQLAlchemyExecutionEngine (#3279)
- [FEATURE] V3 implement ColumnPairValuesInSet for SQL Alchemy execution engine (#3278)
- [FEATURE] Edtf with support levels (#2594) (thanks @mielvds)
- [FEATURE] V3 implement expect_column_pair_values_to_be_equal for SqlAlchemyExecutionEngine (#3267)
- [FEATURE] add expectation for discrete column entropy (#3049) (thanks @edjoesu)
- [FEATURE] Add SQLAlchemy Provider for the the column_pair_values.a_greater_than_b Metric (#3268)
- [FEATURE] Expectations tests for BigQuery backend (#3219) (Thanks @jdimatteo)
- [FEATURE] Add schema validation for different GCS auth methods (#3258)
- [FEATURE] V3 - Implement column_pair helpers/providers for SqlAlchemyExecutionEngine (#3256)
- [FEATURE] V3 implement expect_column_pair_values_to_be_equal expectation for PandasExecutionEngine (#3252)
- [FEATURE] GCS DataConnector schema validation (#3253)
- [FEATURE] Implementation of the "expect_select_column_values_to_be_unique_within_record" Expectation (#3251)
- [FEATURE] Implement the SelectColumnValuesUniqueWithinRecord metric (for PandasExecutionEngine) (#3250)
- [FEATURE] V3 - Implement ColumnPairValuesEqual for PandasExecutionEngine (#3243)
- [FEATURE] Set foundation for GCS DataConnectors (#3220)
- [FEATURE] Implement "expect_column_pair_values_to_be_in_set" expectation (support for PandasExecutionEngine) (#3242)
- [BUGFIX] Fix deprecation warning for importing from collections (#3228) (thanks @ismaildawoodjee)
- [DOCS] Document BigQuery test dataset configuration (#3273) (Thanks @jdimatteo)
- [DOCS] Syntax and Link (#3266)
- [DOCS] API Links and Supporting Docs (#3265)
- [DOCS] redir and search (#3249)
- [MAINTENANCE] Update azure-pipelines-docs-integration.yml to include env vars for Azure docs integration tests
- [MAINTENANCE] Allow Wrong ignore_row_if Directive from V2 with Deprecation Warning (#3274)
- [MAINTENANCE] Refactor test structure for "Connecting to your data" cloud provider integration tests (#3277)
- [MAINTENANCE] Make test method names consistent for Metrics tests (#3254)
- [MAINTENANCE] Allow
PandasExecutionEngine
to acceptAzure DataConnectors
(#3214) - [MAINTENANCE] Standardize Arguments to MetricConfiguration Constructor; Use {} instead of dict(). (#3246)
#
0.13.28- [FEATURE] Implement ColumnPairValuesInSet metric for PandasExecutionEngine
- [BUGFIX] Wrap optional azure imports in data_connector setup
#
0.13.27- [FEATURE] Accept row_condition (with condition_parser) and ignore_row_if parameters for expect_multicolumn_sum_to_equal (#3193)
- [FEATURE] ConfiguredAssetDataConnector for Azure Blob Storage (#3141)
- [FEATURE] Replace MetricFunctionTypes.IDENTITY domain type with convenience method get_domain_records() for SparkDFExecutionEngine (#3226)
- [FEATURE] Replace MetricFunctionTypes.IDENTITY domain type with convenience method get_domain_records() for SqlAlchemyExecutionEngine (#3215)
- [FEATURE] Replace MetricFunctionTypes.IDENTITY domain type with convenience method get_full_access_compute_domain() for PandasExecutionEngine (#3210)
- [FEATURE] Set foundation for Azure-related DataConnectors (#3188)
- [FEATURE] Update ExpectCompoundColumnsToBeUnique for V3 API (#3161)
- [BUGFIX] Fix incorrect schema validation for Azure data connectors (#3200)
- [BUGFIX] Fix incorrect usage of "all()" in the comparison of validation results when executing an Expectation (#3178)
- [BUGFIX] Fixes an error with expect_column_values_to_be_dateutil_parseable (#3190)
- [BUGFIX] Improve parsing of .ge_store_backend_id (#2952)
- [BUGFIX] Remove fixture parameterization for Cloud DBs (Snowflake and BigQuery) (#3182)
- [BUGFIX] Restore support for V2 API style custom expectation rendering (#3179) (Thanks @jdimatteo)
- [DOCS] Add
conda
as installation option in README (#3196) (Thanks @rpanai) - [DOCS] Standardize capitalization of "Python" in "Connecting to your data" section of new docs (#3209)
- [DOCS] Standardize capitalization of Spark in docs (#3198)
- [DOCS] Update BigQuery docs to clarify the use of temp tables (#3184)
- [DOCS] Create _redirects (#3192)
- [ENHANCEMENT] RuntimeDataConnector messaging is made more clear for
test_yaml_config()
(#3206) - [MAINTENANCE] Add
credentials
YAML key support forDataConnectors
(#3173) - [MAINTENANCE] Fix minor typo in S3 DataConnectors (#3194)
- [MAINTENANCE] Fix typos in argument names and types (#3207)
- [MAINTENANCE] Update changelog. (#3189)
- [MAINTENANCE] Update documentation. (#3203)
- [MAINTENANCE] Update validate_your_data.md (#3185)
- [MAINTENANCE] update tests across execution engines and clean up coding patterns (#3223)
#
0.13.26- [FEATURE] Enable BigQuery tests for Azure CI/CD (#3155)
- [FEATURE] Implement MulticolumnMapExpectation class (#3134)
- [FEATURE] Implement the MulticolumnSumEqual Metric for PandasExecutionEngine (#3130)
- [FEATURE] Support row_condition and ignore_row_if Directives Combined for PandasExecutionEngine (#3150)
- [FEATURE] Update ExpectMulticolumnSumToEqual for V3 API (#3136)
- [FEATURE] add python3.9 to python versions (#3143) (Thanks @dswalter)
- [FEATURE]/MER-16/MER-75/ADD_ROUTE_FOR_VALIDATION_RESULT (#3090) (Thanks @rreinoldsc)
- [BUGFIX] Enable
--v3-api suite edit
to proceed without selecting DataConnectors (#3165) - [BUGFIX] Fix error when
RuntimeBatchRequest
is passed toSimpleCheckpoint
withRuntimeDataConnector
(#3152) - [BUGFIX] allow reader_options in the CLI so can read
.csv.gz
files (#2695) (Thanks @luke321321) - [DOCS] Apply Docusaurus tabs to relevant pages in new docs
- [DOCS] Capitalize python to Python in docs (#3176)
- [DOCS] Improve Core Concepts - Expectation Concepts (#2831)
- [MAINTENANCE] Error messages must be friendly. (#3171)
- [MAINTENANCE] Implement the "compound_columns_unique" metric for PandasExecutionEngine (with a unit test). (#3159)
- [MAINTENANCE] Improve Coding Practices in "great_expectations/expectations/expectation.py" (#3151)
- [MAINTENANCE] Update test_script_runner.py (#3177)
#
0.13.25- [FEATURE] Pass on meta-data from expectation json to validation result json (#2881) (Thanks @sushrut9898)
- [FEATURE] Add sqlalchemy engine support for
column.most_common_value
metric (#3020) (Thanks @shpolina) - [BUGFIX] Added newline to CLI message for consistent formatting (#3127) (Thanks @ismaildawoodjee)
- [BUGFIX] fix pip install snowflake build error with Python 3.9 (#3119) (Thanks @jdimatteo)
- [BUGFIX] Populate (data) asset name in data docs for RuntimeDataConnector (#3105) (Thanks @ceshine)
- [DOCS] Correct path to docs_rtd/changelog.rst (#3120) (Thanks @jdimatteo)
- [DOCS] Fix broken links in "How to write a 'How to Guide'" (#3112)
- [DOCS] Port over "How to add comments to Expectations and display them in DataDocs" from RTD to Docusaurus (#3078)
- [DOCS] Port over "How to create a Batch of data from an in memory Spark or Pandas DF" from RTD to Docusaurus (#3099)
- [DOCS] Update CLI codeblocks in create_your_first_expectations.md (#3106) (Thanks @ories)
- [MAINTENANCE] correct typo in docstring (#3117)
- [MAINTENANCE] DOCS/GDOC-130/Add Changelog (#3121)
- [MAINTENANCE] fix docstring for expectation "expect_multicolumn_sum_to_equal" (previous version was not precise) (#3110)
- [MAINTENANCE] Fix typos in docstrings in map_metric_provider partials (#3111)
- [MAINTENANCE] Make sure that all imports use column_aggregate_metric_provider (not column_aggregate_metric). (#3128)
- [MAINTENANCE] Rename column_aggregate_metric.py into column_aggregate_metric_provider.py for better code readability. (#3123)
- [MAINTENANCE] rename ColumnMetricProvider to ColumnAggregateMetricProvider (with DeprecationWarning) (#3100)
- [MAINTENANCE] rename map_metric.py to map_metric_provider.py (with DeprecationWarning) for a better code readability/interpretability (#3103)
- [MAINTENANCE] rename table_metric.py to table_metric_provider.py with a deprecation notice (#3118)
- [MAINTENANCE] Update CODE_OF_CONDUCT.md (#3066)
- [MAINTENANCE] Upgrade to modern Python syntax (#3068) (Thanks @cclauss)
#
0.13.24- [FEATURE] Script to automate proper triggering of Docs Azure pipeline (#3003)
- [BUGFIX] Fix an undefined name that could lead to a NameError (#3063) (Thanks @cclauss)
- [BUGFIX] fix incorrect pandas top rows usage (#3091)
- [BUGFIX] Fix parens in Expectation metric validation method that always returned True assertation (#3086) (Thanks @morland96)
- [BUGFIX] Fix run_diagnostics for contrib expectations (#3096)
- [BUGFIX] Fix typos discovered by codespell (#3064) (Thanks cclauss)
- [BUGFIX] Wrap get_view_names in try clause for passing the NotImplemented error (#2976) (Thanks @kj-9)
- [DOCS] Ensuring consistent style of directories, files, and related references in docs (#3053)
- [DOCS] Fix broken link to example DAG (#3061) (Thanks fritz-astronomer)
- [DOCS] GDOC-198 cleanup TOC (#3088)
- [DOCS] Migrating pages under guides/miscellaneous (#3094) (Thanks @spbail)
- [DOCS] Port over âHow to configure a new Checkpoint using test_yaml_configâ from RTD to Docusaurus
- [DOCS] Port over âHow to configure an Expectation store in GCSâ from RTD to Docusaurus (#3071)
- [DOCS] Port over âHow to create renderers for custom Expectationsâ from RTD to Docusaurus
- [DOCS] Port over âHow to run a Checkpoint in Airflowâ from RTD to Docusaurus (#3074)
- [DOCS] Update how-to-create-and-edit-expectations-in-bulk.md (#3073)
- [MAINTENANCE] Adding a comment explaining the IDENTITY metric domain type. (#3057)
- [MAINTENANCE] Change domain key value from âcolumnâ to âcolumn_listâ in ExecutionEngine implementations (#3059)
- [MAINTENANCE] clean up metric errors (#3085)
- [MAINTENANCE] Correct the typo in the naming of the IDENTIFICATION semantic domain type name. (#3058)
- [MAINTENANCE] disable snowflake tests temporarily (#3093)
- [MAINTENANCE][DOCS] Port over âHow to host and share Data Docs on GCSâ from RTD to Docusaurus (#3070)
- [MAINTENANCE] Enable repr for MetricConfiguration to assist with troubleshooting. (#3075)
- [MAINTENANCE] Expand test of a column map metric to underscore functionality. (#3072)
- [MAINTENANCE] Expectation anonymizer supports v3 expectation registry (#3092)
- [MAINTENANCE] Fix -- check for column key existence in accessor_domain_kwargsn for condition map partials. (#3082)
- [MAINTENANCE] Missing import of SparkDFExecutionEngine was added. (#3062)
#
Older ChangelistOlder changelist can be found at https://github.com/great-expectations/great_expectations/blob/develop/docs_rtd/changelog.rst