Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Syncing #1

Merged
merged 75 commits into from
Oct 6, 2018
Merged

Syncing #1

merged 75 commits into from
Oct 6, 2018

Conversation

cTxplorer
Copy link
Owner

@cTxplorer cTxplorer commented Oct 6, 2018

We are excited to review your PR.

So we can do the best job, please check:

  • There's a descriptive title that will make sense to other developers some time from now.
  • There's associated issues. All PR's should have issue(s) associated - unless a trivial self-evident change such as fixing a typo. You can use the format Fixes #nnnn in your description to cause GitHub to automatically close the issue(s) when your PR is merged.
  • Your change description explains what the change does, why you chose your approach, and anything else that reviewers should know.
  • You have included any necessary tests in the same PR.

sfilipi and others added 30 commits September 21, 2018 22:01
Fixes #982
* different Config files for train and test
* solves problem of long running time
* train benchmarks contain only one iteration as it gives more idea on how the users will use. (with no warmup iteration)
* predict config is the original version
* Add analayzer to nuget
* Generalize target file to enable easier analayzer inclusions in future
* Transform wrappers and a reference implementation for tokenizers

* Added pigsty extensions

* Added pigsty test

* Fixed most important PR comments

* PR comments

* Converted all text transforms into transformers/estimators.

* Addressed reviewers's comments.

* Addressed reviewers' comments.

* Converted LdaTransform into Transformer/Estimator.

* Fixed LdaNative issue and addressed reviewers' comments.

* Fixed issue with test.

* Diabled end-to-end LdaTransform test due to incosistency of outputs.
* Microsoft.ML.Data.StaticPipe to Microsoft.ML.StaticPipe.
* Columns now in StaticPipe as opposed to StaticPipe.Runtime.
* Stop loading assemblies in ComponentCatalog.

Write the AssemblyName into the model, and use it to register the assembly during model load.

* Move ComponentCatalog from a static class to a member of IHostEnvironment.

* Update tests for ComponentCatalog refactoring.

* minor cleanup

* Add AssemblyName to all model VersionInfo instances.

Also fix a couple more tests.

* Load and register all assemblies in the Maml directory.
Ensure all loaded assemblies are registered in Experiment to maintain compability.
Fix tests to not use ComponentCatalog but direct instantiation instead.

* Sync up with latest code.

* Fix newly added test

* Clean up some test changes.

* Fix up for latest code

* Add path filtering back to LoadAssembliesInDir

* Update TestAutoInference to use the correct Environment.

* Respond to PR feedback.

* Make all AutoInference tests use LocalEnvironment.
…ot frozen (#853)

* building transform from ground up

* dummy transform works after fixing the getters

* SavedModel format works for Train, but fails for Save&Predict

* remove dummy transform

* remove dummy unit test

* Works with non-frozen models

* building transform from ground up

* dummy transform works after fixing the getters

* SavedModel format works for Train, but fails for Save&Predict

* remove dummy transform

* remove dummy unit test

* fix compilation issues; verify existing tests work fine

* works locally; need to refactor code

* refactored code; keeping only 1 version of the convenience API

* added class for directory structure

* using latest nuget package (0.0.3) for Microsoft.ML.TensorFlow.TestModels

* delete temporary files used when loading/saving models

* delete local models; the updated nuget version (0.0.3) for Microsoft.ML.TensorFlow.TestModels contains these models

* modified logic for load/restore of models

* modified logic for load&restore of unfrozen models

* model version update to support non-frozen models

* based on the code review comments, we now infer if the provided model is frozen or not

* simplify the logic in Save() related to loading of SavedModel.

* trying Eric's suggestion

* revert back to previous changes

* attempt to use stream copy approach instead of in-memory

* deleting some commented out code

* Ensure we only copy the file segment & cleanup path logic

* added finalizer that closes the session (if it isn't closed) and deletes the temporary directory

* cleanup + misc review comments

* trying to create temp dir with proper ACLs for high priviledge users

* create temp dir with proper ACLs for high-privilege processes

* fix build after merge with latest master

* taking care of review comments related to model versioning of TFTransform

* remove IDisposable from the TensorFlowTransform; renaming some methods

* refactor code so we have only 1 constructor for the TensorFlowTransform (as suggested in review comment)

* fix issues with nuget packaging; refactored the code + added comments

* add checks in code to make sure that the input is not a variable length vector

* fix typo in name of package

* (1) added SavedModel test for MNIST model (2) added try/finally for deleting temp folder  (3) deleted test using Legacy Learning API

* remove and sort usings in file TrainSaveModelAndPredict.cs

* using spaces in nupkgproj

* error checking for passed in IHostEnvironment

* fix TargetFramework version (netcore 2.0) of DnnAnalyzer to match that of Microsoft.ML.TensorFlow
…o estimators (#957)

* derived from trainerestimatorbase

* cleaned up

* sorting namespaces

* fixed review comments, still some more features to add

* pr comments on tests

* updated the tests

* fixed review comments

* fixing review comments

* fixed bugs

* refactored code

* fixed review comments and cleaned up

* cleaned up code and fixed documentation text

* fixed review comments

* fixed review comments

* fixed review comments

* fized review comment

* fixed more review comments
adding PcaTrainer as estimator and tidying the PredictionTransformer class.
…on (#1009)

* FastTree classification and regression xtensions
Adds a benchmark test to measure performance of doing many single predictions with PredictionEngine.

Closes #1013
* Remove the error tracing when assembly loading fails for Maml.

Also adding our native assemblies to the list to skip, so they aren't attempted to be loaded.

Fix #1034
Helps the user to relate to the macOS version faster.
* Multiclass logistic Regression tests enabled

* threshold providing in tests

* defining tolerance as a constant in baseTestBaseline Class

* upper case camel for constant and _ for large decimal numbers
GalOshri and others added 29 commits October 1, 2018 11:04
* Add a workaround for the tests hanging while loading MKL.

The workaround is to ensure the MKL library is loaded very early in the test process, so it doesn't cause the deadlock.

Workaround #1073

Another deadlock also occurs when running TestAutoInference and TestPipelineSweeper in parallel. Marking these tests to not run in parallel anymore.

Workaround #1095

Moving back to the Azure Hosted VS2017 pool to run the tests now that we've narrowed the deadlocks down.
…1032)

* Add instructions for building for .NET Core 3.0, and make them work.

Fix #1011

* Add config specific properties for the Intrinsics configs.

* Allow tests to be run against .NET Core 3.0
* Port of time series.
)

* Static pipelines now handle types with PipelineColumn properties.

* Update the internal infrastructure to accomodate these types,
* Update the Roslyn analyzer to accomodate these types.
* Update the tests so that they exercise this capability.
* Opportunistically fix some problems with the Roslyn analyzer
  brought up in this work.
* turned string separators into char array separators

* fixed review comments

* allowed the old api to still work through the arguments object

* added command line test

* fixed test, and added visibility field to arguments

* fixing review comments
…tructors (#1135)

* Remove ComponentCatalog from EntryPointGraph's and GraphRunner's constructors

* Remove catalog temp variable, use directly in call to ValidateNodes

* Remove catalog temp variable from GraphRunner constructor
* add .NET Core 3.0 support for the benchmarks

* code review fixes: keep it simple
* Adding the Samples.StaticPipe project.

* Adding a sample for SDCA Regression
* Remove explicit ComponentCatalog

ValidateNodes and EntryPointNode now use the ComponentCatalog
property of IHostEnvironment.
) (#1141)

* Updating the CopyColumnsEstimator and Transform to use common code
  (#706)
This builds on the Estimator conversion for the CopyColumnsTransform.
This change is mainly refactoring as common code has moved to base level classes.

This change is the following:
       - CopyColumnTransform now derives from OneToOneTransformerBase
       - CopyColumnEstimator now derives from TrivialEstimator
       - CopyColumnTransform::Mapper now derives from MapperBase
       - Removed code that was no longer needed due to these changes

*  - Moved CopyColumnsTransform into Microsoft.ML.Transforms namespace,
 updated namespace usage and entrypoints due to this change.
 - Save now uses the SaveColumns from the base class
 - Other various changes based upon feedback.
* TrainUtils.Train does not have consistent API usage for the calibrator argument (#1023)

Updates the API signature for TrainUtils.Train to take in an IComponentFactory<ICalibratorTrainer>.

Fixes #1023
* conversion of multiclass naive bayes classifier to estimator

* added pigstension and related test

* added public methods to access label and feature histograms in the predictor

* fixed review comments on new access functions

* moved test to main file
…#1145)

* Fix MatchNumberWithTolerance to better compare floating-point values

* Updating CheckEqualityFromPathsCore to allow a tolerance match on Windows
@cTxplorer cTxplorer merged commit 511503b into cTxplorer:master Oct 6, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.