关于人工智能:数据科学家的Pytest

作者|Khuyen Tran
编译|VK
起源|Towards Datas Science

动机

利用不同的python代码来解决notebook中的数据是很乏味的，然而为了使代码具备可复制性，你须要将它们放入函数和类中。将代码放入脚本时，代码可能会因某些函数而中断。那么，如何查看你的性能是否如你所冀望的那样工作呢？

例如，咱们应用TextBlob创立一个函数来提取文本的情感，TextBlob是一个用于解决文本数据的Python库。咱们心愿确保它像咱们预期的那样工作：如果测试为踊跃，函数返回一个大于0的值；如果文本为消极，则返回一个小于0的值。

from textblob import TextBlob

def extract_sentiment(text: str):
        '''应用textblob提取情绪。
            在范畴[- 1,1]内'''

        text = TextBlob(text)

        return text.sentiment.polarity

要晓得函数是否每次都会返回正确的值，最好的办法是将这些函数利用于不同的示例，看看它是否会产生咱们想要的后果。这就是测试的重要性。

一般来说，你应该在数据迷信我的项目中应用测试，因为它容许你：

确保代码按预期工作
检测边缘状况
有信念用改良的代码替换现有代码，而不用放心毁坏整个管道

有许多Python工具可用于测试，但最简略的工具是Pytest。

Pytest入门

Pytest是一个框架，它使得用Python编写小测试变得容易。我喜爱pytest，因为它能够帮忙我用起码的代码编写测试。如果你不相熟测试，那么pytest是一个很好的入门工具。

要装置pytest，请运行

pip install -U pytest

要测试下面所示的函数，咱们能够简略地创立一个函数，该函数以test_结尾，前面跟着咱们要测试的函数的名称，即extract_sentiment

#sentiment.py
def extract_sentiment(text: str):
        '''应用textblob提取情绪。
            在范畴[- 1,1]内'''

        text = TextBlob(text)

        return text.sentiment.polarity

def test_extract_sentiment():

    text = "I think today will be a great day"

    sentiment = extract_sentiment(text)

    assert sentiment > 0

在测试函数中，咱们将函数extract_sentiment利用于示例文本：“I think today will be a great day”。咱们应用assert sentiment > 0来确保情绪是踊跃的。

就这样！当初咱们筹备好运行测试了。

如果咱们的脚本名是sentiment.py，咱们能够运行

pytest sentiment.py

Pytest将遍历咱们的脚本并运行以test结尾的函数。下面的测试输入如下所示

========================================= test session starts ==========================================
platform linux -- Python 3.8.3, pytest-5.4.2, py-1.8.1, pluggy-0.13.1

collected 1 item
process.py .                                                                                     [100%]

========================================== 1 passed in 0.68s ===========================================

很酷！咱们不须要指定要测试哪个函数。只有函数名以test结尾，pytest就会检测并执行该函数！咱们甚至不须要导入pytest就能够运行pytest

如果测试失败，pytest会产生什么输入？

#sentiment.py

def test_extract_sentiment():

    text = "I think today will be a great day"

    sentiment = extract_sentiment(text)

    assert sentiment < 0

>>> pytest sentiment.py

========================================= test session starts ==========================================
platform linux -- Python 3.8.3, pytest-5.4.2, py-1.8.1, pluggy-0.13.1
collected 1 item

process.py F                                                                                     [100%]
=============================================== FAILURES ===============================================
________________________________________ test_extract_sentiment ________________________________________

def test_extract_sentiment():
    
        text = "I think today will be a great day"
    
        sentiment = extract_sentiment(text)
    
>       assert sentiment < 0
E       assert 0.8 < 0

process.py:17: AssertionError
======================================= short test summary info ========================================
FAILED process.py::test_extract_sentiment - assert 0.8 < 0
========================================== 1 failed in 0.84s ===========================================

从输入能够看出，测试失败是因为函数的情感值为0.8，并且不小于0！咱们不仅能够晓得咱们的函数是否如预期的那样工作，而且还能够晓得为什么它不起作用。从这个角度来看，咱们晓得在哪里修复咱们的函数，以实现咱们想要的性能。

同一函数的屡次测试

咱们能够用其余例子来测试咱们的函数。新测试函数的名称是什么？

第二个函数的名称能够是test_extract_sentiment_2，如果咱们想在带有负面情绪的文本上测试函数，那么它的名称能够是test_extract_sentiment_negative。任何函数名只有以test结尾就能够工作

#sentiment.py

def test_extract_sentiment_positive():

    text = "I think today will be a great day"

    sentiment = extract_sentiment(text)

    assert sentiment > 0

def test_extract_sentiment_negative():

    text = "I do not think this will turn out well"

    sentiment = extract_sentiment(text)

    assert sentiment < 0

>>> pytest sentiment.py

========================================= test session starts ==========================================
platform linux -- Python 3.8.3, pytest-5.4.2, py-1.8.1, pluggy-0.13.1
collected 2 items

process.py .F                                                                                    [100%]
=============================================== FAILURES ===============================================
___________________________________ test_extract_sentiment_negative ____________________________________

def test_extract_sentiment_negative():
    
        text = "I do not think this will turn out well"
    
        sentiment = extract_sentiment(text)
    
>       assert sentiment < 0
E       assert 0.0 < 0

process.py:25: AssertionError
======================================= short test summary info ========================================
FAILED process.py::test_extract_sentiment_negative - assert 0.0 < 0
===================================== 1 failed, 1 passed in 0.80s ======================================

从输入中，咱们晓得一个测试通过，一个测试失败，以及测试失败的起因。咱们心愿“I do not think this will turn out well”这句话是消极的，但后果却是0。

这有助于咱们了解，函数可能不会100%精确；因而，在应用此函数提取文本情感时，咱们应该审慎。

参数化：组合测试

以上2个测试性能用于测试同一性能。有没有方法把两个例子合并成一个测试函数？这时参数化就派上用场了

用样本列表参数化

应用pytest.mark.parametrize()，通过在参数中提供示例列表，咱们能够应用不同的示例执行测试。

# sentiment.py

from textblob import TextBlob
import pytest

def extract_sentiment(text: str):
        '''应用textblob提取情绪。
            在范畴[- 1,1]内'''

        text = TextBlob(text)

        return text.sentiment.polarity

testdata = ["I think today will be a great day","I do not think this will turn out well"]

@pytest.mark.parametrize('sample', testdata)
def test_extract_sentiment(sample):

    sentiment = extract_sentiment(sample)

    assert sentiment > 0

在下面的代码中，咱们将变量sample调配给一个示例列表，而后将该变量增加到测试函数的参数中。当初每个例子将一次测试一次。

========================== test session starts ===========================
platform linux -- Python 3.8.3, pytest-5.4.2, py-1.8.1, pluggy-0.13.1
collected 2 items

sentiment.py .F                                                    [100%]

================================ FAILURES ================================
_____ test_extract_sentiment[I do not think this will turn out well] _____

sample = 'I do not think this will turn out well'

@pytest.mark.parametrize('sample', testdata)
    def test_extract_sentiment(sample):
    
        sentiment = extract_sentiment(sample)
    
>       assert sentiment > 0
E       assert 0.0 > 0

sentiment.py:19: AssertionError
======================== short test summary info =========================
FAILED sentiment.py::test_extract_sentiment[I do not think this will turn out well]
====================== 1 failed, 1 passed in 0.80s ===================

应用parametrize()，咱们能够在once函数中测试两个不同的示例！

应用示例列表和预期输入进行参数化

如果咱们冀望不同的例子有不同的输入呢？Pytest还容许咱们向测试函数的参数增加示例和预期输入！

例如，上面的函数查看文本是否蕴含特定的单词。

def text_contain_word(word: str, text: str):
    '''查看文本是否蕴含特定的单词'''
    
    return word in text

如果文本蕴含单词，则返回True。

如果单词是“duck”，而文本是“There is a duck in this text”，咱们冀望返回True。

如果单词是‘duck’，而文本是‘There is nothing here’，咱们冀望返回False。

咱们将应用parametrize()而不应用元组列表。

# process.py
import pytest
def text_contain_word(word: str, text: str):
    '''查找文本是否蕴含特定的单词'''
    
    return word in text

testdata = [
    ('There is a duck in this text',True),
    ('There is nothing here', False)
    ]

@pytest.mark.parametrize('sample, expected_output', testdata)
def test_text_contain_word(sample, expected_output):

    word = 'duck'

    assert text_contain_word(word, sample) == expected_output

函数的参数构造为parametrize(‘sample，expected_out’，’testdata)，testdata=[(<sample1>，<output1>)，(<sample2>，<output2>)

>>> pytest process.py

========================================= test session starts ==========================================
platform linux -- Python 3.8.3, pytest-5.4.2, py-1.8.1, pluggy-0.13.1
plugins: hydra-core-1.0.0, Faker-4.1.1
collected 2 items

process.py ..                                                                                    [100%]

========================================== 2 passed in 0.04s ===========================================

咱们的两个测试都通过了！

一次测试一个函数

当脚本中测试函数的数量越来越大时，你可能心愿一次测试一个函数而不是多个函数。用pytest很容易，pytest file.py::function_name

testdata = ["I think today will be a great day","I do not think this will turn out well"]

@pytest.mark.parametrize('sample', testdata)
def test_extract_sentiment(sample):

    sentiment = extract_sentiment(sample)

    assert sentiment > 0


testdata = [
    ('There is a duck in this text',True),
    ('There is nothing here', False)
    ]

@pytest.mark.parametrize('sample, expected_output', testdata)
def test_text_contain_word(sample, expected_output):

    word = 'duck'

    assert text_contain_word(word, sample) == expected_output

例如，如果你只想运行test_text_contain_word，请运行

pytest process.py::test_text_contain_word

而pytest只执行咱们指定的一个测试！

fixture：应用雷同的数据来测试不同的函数

如果咱们想用雷同的数据来测试不同的函数呢？例如，咱们想测试“今Today I found a duck and I am happy”这句话是否蕴含“duck ”这个词，它的情绪是否是踊跃的。这是fixture派上用场的时候。

pytest fixture是一种向不同的测试函数提供数据的办法

@pytest.fixture
def example_data():
    return 'Today I found a duck and I am happy'


def test_extract_sentiment(example_data):

    sentiment = extract_sentiment(example_data)

    assert sentiment > 0

def test_text_contain_word(example_data):

    word = 'duck'

    assert text_contain_word(word, example_data) == True

在下面的示例中，咱们应用decorator创立了一个示例数据@pytest.fixture在函数example_data的上方。这将把example_data转换成一个值为“Today I found a duck and I am happy”的变量

当初，咱们能够应用示例数据作为任何测试的参数！

组织你的我的项目

最初但并非最不重要的是，当代码变大时，咱们可能须要将数据迷信函数和测试函数放在两个不同的文件夹中。这将使咱们更容易找到每个函数的地位。

用test_<name>.py或<name>_test.py命名咱们的测试函数. Pytest将搜寻名称以“test”结尾或以“test”结尾的文件，并在该文件中执行名称以“test”结尾的函数。这很不便！

有不同的办法来组织你的文件。你能够将咱们的数据迷信文件和测试文件组织在同一个目录中，也能够在两个不同的目录中组织，一个用于源代码，一个用于测试

办法1：

test_structure_example/
├── process.py
└── test_process.py

办法2：

test_structure_example/
├── src
│   └── process.py
└── tests
    └── test_process.py

因为数据迷信函数很可能有多个文件，测试函数有多个文件，所以你可能须要将它们放在不同的目录中，如办法2。

这是2个文件的样子

from textblob import TextBlob

def extract_sentiment(text: str):
        '''应用textblob提取情绪。
            在范畴[- 1,1]内'''

        text = TextBlob(text)

        return text.sentiment.polarity

import sys
import os.path
sys.path.append(
    os.path.abspath(os.path.join(os.path.dirname(__file__), os.path.pardir)))
from src.process import extract_sentiment
import pytest


def test_extract_sentiment():

    text = 'Today I found a duck and I am happy'

    sentiment = extract_sentiment(text)

    assert sentiment > 0

简略地说增加sys.path.append(os.path.abspath(os.path.join(os.path.dirname(__file__), os.path.pardir)))能够从父目录导入函数。

在根目录(test_structure_example/)下，运行pytest tests/test_process.py或者运行在test_structure_example/tests目录下的pytest test_process.py。

========================== test session starts ===========================
platform linux -- Python 3.8.3, pytest-5.4.2, py-1.8.1, pluggy-0.13.1
collected 1 item

tests/test_process.py .                                            [100%]

=========================== 1 passed in 0.69s ============================

很酷！

论断

你刚刚理解了pytest。我心愿本文能很好地概述为什么测试很重要，以及如何将测试与pytest联合到数据迷信我的项目中。通过测试，你不仅能够晓得你的函数是否按预期工作，而且还能够自信地应用不同的工具或不同的代码构造来切换现有代码。

本文的源代码能够在这里找到：

https://github.com/khuyentran…

我喜爱写一些根本的数据迷信概念，玩不同的算法和数据迷信工具。

原文链接：https://towardsdatascience.co…

欢送关注磐创AI博客站：
http://panchuang.net/

sklearn机器学习中文官网文档：
http://sklearn123.com/

欢送关注磐创博客资源汇总站：
http://docs.panchuang.net/

关于人工智能:数据科学家的Pytest

动机

Pytest入门

同一函数的屡次测试

参数化：组合测试

用样本列表参数化

应用示例列表和预期输入进行参数化

一次测试一个函数

fixture：应用雷同的数据来测试不同的函数

组织你的我的项目

论断

评论

发表回复取消回复

更多文章

DDN HPC 存储硬件架构设计深度分析

探秘IO500：从Lustre并行文件系统出发，开启HPC存储性能新征程

苹果iOS打包的ipa应用无法安装？一篇文章带你了解可能的原因及排查方法

图解Golang：从零开始实现简易版过期LRU缓存

关于人工智能:数据科学家的Pytest

动机

Pytest入门

同一函数的屡次测试

参数化：组合测试

用样本列表参数化

应用示例列表和预期输入进行参数化

一次测试一个函数

fixture：应用雷同的数据来测试不同的函数

组织你的我的项目

论断

评论

发表回复 取消回复

更多文章

DDN HPC 存储硬件架构设计深度分析

探秘IO500：从Lustre并行文件系统出发，开启HPC存储性能新征程

苹果iOS打包的ipa应用无法安装？一篇文章带你了解可能的原因及排查方法

图解Golang：从零开始实现简易版过期LRU缓存

发表回复取消回复