src/sagemaker_sklearn_extension/preprocessing/encoders.py (1 line): - line 1043: # TODO: In the paper this function is based on the ngram number was fixed as 3. As a follow up, consider