helpers

setTFConfig

This method configures the TensorFlow library settings along with configuring GPU settings.

printSampleCounts

This method prints the respective samples in the given dataframes.

def printSampleCounts(plotClassifications, training_dataframe, validation_dataframe, test_dataframe=None):
    if test_dataframe != None:
        print("Class | Train | Valid | Test")
    else:
        print("Class | Train | Valid")

    for Class in plotClassifications:
        className = Class["Classification"]
        trainCount = training_dataframe.loc[training_dataframe.label == className].shape[0]
        validCount = validation_dataframe.loc[validation_dataframe.label == className].shape[0]
        row=className+"  |  "+str(trainCount)+" ("+str(float(trainCount)/float(training_dataframe.shape[0]))+")  |  "+str(validCount)+" ("+str(float(validCount)/validation_dataframe.shape[0])+")"
        if test_dataframe != None:
            testCount = test_dataframe.loc[test_dataframe.label == className].shape[0]
            row+" | "+str(testCount)+" ("+str(float(testCount)/test_dataframe.shape[0])+")"
        print(row)
        # logging.info(row)
        if validCount == 0:
            print("Missing example for "+str(className)+" exiting")
            # logging.error("Missing example for "+str(className)+" exiting")

Parameters

plotClassifications: A list representing the classifications of the plots
training_dataframe: A Pandas DataFrame of the training data
validation_dataframe: A Pandas DataFrame of the validation data
test_dataframe: An optional dataframe of the test data

getGenerator

This method returns data generators for the validation, training, and testing datasets. The pixels are normalized and the generator is configured to respective settings. If a testing dataframe is unavailable, the validation dataframe is used to predict a possible dataframe.

# Extended code available on Github
def getGenerator(training_dataframe, validation_dataframe, test_dataframe=None, BS=32):

Parameters

training_dataframe: A Pandas DataFrame of the training data
validation_dataframe: A Pandas DataFrame of the validation data
test_dataframe: An optional dataframe of the test data
BS: The generators’ batch size. Defaults to 32.