https://github.com/cair/tm-xor-proof

#tsetlin-machine #machine-learning #game-theory #propositional-logic #pattern-recognition #bandit-learning #frequent-pattern-mining #learning-automata

Last synced: 10 months ago · JSON representation

Repository

#tsetlin-machine #machine-learning #game-theory #propositional-logic #pattern-recognition #bandit-learning #frequent-pattern-mining #learning-automata

Basic Info

Host: GitHub
Owner: cair
Language: Cython
Default Branch: main
Homepage:
Size: 68.4 KB

Statistics

Stars: 5
Watchers: 5
Forks: 1
Open Issues: 0
Releases: 0

Created over 5 years ago · Last pushed almost 4 years ago

Metadata Files

Readme

Convergence-of-Tsetlin-Machine-for-Fundamental-Digital-Operations-the-XOR-Case

The Tsetlin Machine (TM) is a novel machine learning algorithm with several distinct properties, including transparent inference/learning and hardware-near building blocks. Although numerous papers explore the TM empirically, many of its properties have not yet been analyzed mathematically. In this article, we analyze the convergence of the TM when input is non-linearly related to output by the XOR-operator. Our analysis reveals that the TM, with just two conjunctive clauses, can converge surely to reproducing XOR, learning from training data over an infinite time horizon. Furthermore, the analysis shows how the hyper-parameter T guides clause construction so that the clauses capture the distinct sub-patterns in the data. Our analysis of convergence for XOR thus lays the foundation for analyzing other more complex logical expressions. These analyses altogether, from a mathematical perspective, provide new insights on why TMs have obtained state-of-the-art performance on several pattern recognition problems.

(https://arxiv.org/abs/2101.02547)

Code: XORDemo.py

Here we have the basic experiment of the proof where we utilize a Tsetlin Machine having merely two clauses to learn the sub-patterns of only class 1. Towards the end of the code, the state of the Tsetlin automata at end of training in both the clauses are printed to reveal the pattern the each clause has learned. As prented below in the output, the clause 1 has learned the pattern (0 1) while the clause 2 learns pattern (1 0).

```ruby

!/usr/bin/python

import cython

import numpy as np import pyximport; pyximport.install(setupargs={ "includedirs":np.getinclude()}, reloadsupport=True)

import XOR

X = np.random.randint(2, size=(10000,2), dtype=np.int32) Y = np.ones([10000]).astype(dtype=np.int32)

for i in range(10000): if X[i,0] == X[i,1]: Y[i] = 0

Parameters for the Tsetlin Machine

T = 1 s = 3.9 numberofclauses = 2 states = 100 Th = 1

Parameters of the pattern recognition problem

numberoffeatures = 2

Training configuration

epochs = 200

Loading of training and test data

NoOfTrainingSamples = len(X)*80//100 NoOfTestingSamples = len(X) - NoOfTrainingSamples

Xtraining = X[0:NoOfTrainingSamples,:] # Input features ytraining = Y[0:NoOfTrainingSamples] # Target value

Xtest = X[NoOfTrainingSamples:NoOfTrainingSamples+NoOfTestingSamples,:] # Input features ytest = Y[NoOfTrainingSamples:NoOfTrainingSamples+NoOfTestingSamples] # Target value

This is a multiclass variant of the Tsetlin Machine, capable of distinguishing between multiple classes

tsetlinmachine = XOR.TsetlinMachine(numberofclauses, numberof_features, states, s, T, Th)

Training of the Tsetlin Machine in batch mode. The Tsetlin Machine can also be trained online

tsetlinmachine.fit(Xtraining, ytraining, ytraining.shape[0], epochs=epochs)

Some performacne statistics

print ("Accuracy on test data (no noise):", tsetlinmachine.evaluate(Xtest, ytest, ytest.shape[0]))

for clause in range(numberofclauses): print('Clause', clause+1), for feature in range(numberoffeatures): for tatype in range(2): State = tsetlinmachine.getstate(clause,feature,tatype) if State >= 101: Decision = 'In' else: Decision = 'Ex' print('feature %d TA %d State %d' % (feature, tatype+1, State)), print('/n') ```

Output

``` Clause 1 feature 0 TA 1 State 99 feature 0 TA 2 State 101 feature 1 TA 1 State 101 feature 1 TA 2 State 12

Clause 2 feature 0 TA 1 State 101 feature 0 TA 2 State 70 feature 1 TA 1 State 100 feature 1 TA 2 State 101 ```

Code: XORDemo_print.py

In this experiment, we are interested in seeing the distribution of the sub-potterns among clauses when different thresholds are used (T=2 and T=3: a detail explanation can be found in the section four of the article). For this, the state of the Tsetlin Automata in each clause at each training epoch is saved into an excel file. After a socondary analysis on excel, plots in section four in the artical are created.

```ruby

!/usr/bin/python

import numpy as np import pyximport; pyximport.install(setupargs={ "includedirs":np.getinclude()}, reloadsupport=True)

import XOR_print

samples = 50 X = np.random.random_integers(0,1,size=(samples,2)).astype(dtype=np.int32) Y = np.ones([samples]).astype(dtype=np.int32)

for i in range(samples): if X[i,0] == X[i,1]: Y[i] = 0

Parameters for the Tsetlin Machine

T = 2 s = 3.9 numberofclauses = 5 states = 100 Th = 1

Parameters of the pattern recognition problem

numberoffeatures = 2

Training configuration

epochs = 200

Loading of training and test data

NoOfTrainingSamples = len(X)*80//100 NoOfTestingSamples = len(X) - NoOfTrainingSamples

Xtraining = X[0:NoOfTrainingSamples,:] # Input features ytraining = Y[0:NoOfTrainingSamples] # Target value

Xtest = X[NoOfTrainingSamples:NoOfTrainingSamples+NoOfTestingSamples,:] # Input features ytest = Y[NoOfTrainingSamples:NoOfTrainingSamples+NoOfTestingSamples] # Target value

This is a multiclass variant of the Tsetlin Machine, capable of distinguishing between multiple classes

tsetlinmachine = XORprint.TsetlinMachine(numberofclauses, numberoffeatures, states, s, T, Th)

Training of the Tsetlin Machine in batch mode. The Tsetlin Machine can also be trained online

tsetlinmachine.fit(Xtraining, ytraining, ytraining.shape[0], epochs=epochs)

Some performacne statistics

print ("Accuracy on test data (no noise):", tsetlinmachine.evaluate(Xtest, ytest, ytest.shape[0])) ```

Output

Accuracy on test data (no noise): 1.0

Code: proofofthe_convergence.py

This is the python implementation of Algorithm 1 in Apendix 1 of the ariticle. Mainly, the code generates the Transition Probability Matrix, computes the transpose of it and returns the result of power infinity.

```ruby import numpy as np

s = 10

DiaSum = np.zeros([256,1]) M = np.zeros([256,256]) Mx = np.zeros([254,1,8]) AbsorbingStates = np.zeros([2,1,8])

My = np.zeros([254,1,8])

Input = np.zeros([4,2]) Input[1,1],Input[2,0],Input[3,0],Input[3,1] = 1,1,1,1

k = 0 for i in range(256): if i == 105 or i == 150: continue k += 1 binary = "{0:08b}".format(i) x = 0 #Mx[k-1,1,0] = i for j in binary: Mx[k-1,0,x] = int(j) x += 1

y = 0 for i in (105,150): binary = "{0:08b}".format(i) x = 0 #Mx[k-1,1,0] = i for j in binary: AbsorbingStates[y,0,x] = int(j) x += 1 y += 1

Mx = np.vstack((AbsorbingStates,Mx)) My = Mx

def CalculateClasueOutputs(TADecisions, Input):
TADecisions = np.reshape(TADecisions, (2, 4), order='F')

Clause1 = 1
if (TADecisions[0,0] == 1 and Input[0] == 0) or (TADecisions[1,0] == 1 and Input[0] == 1) or (TADecisions[0,1] == 1 and Input[1] == 0) or (TADecisions[1,1] == 1 and Input[1] == 1):
    Clause1 = 0

Clause2 = 1
if (TADecisions[0,2] == 1 and Input[0] == 0) or (TADecisions[1,2] == 1 and Input[0] == 1) or (TADecisions[0,3] == 1 and Input[1] == 0) or (TADecisions[1,3] == 1 and Input[1] == 1):
    Clause2 = 0

return Clause1, Clause2

def ActivationProbability(FeedbackType, v): if FeedbackType == 2: if v == 0: ActProbability = 0.5 else: ActProbability = 1 else: if v == 0: ActProbability = 0.5 else: ActProbability = 0.0

return ActProbability

def getLiteral(index, Input): if index % 2 == 0: literalType = 1 else: literalType = 0

if index % 4 < 2:
    INx = 0
else:
    INx = 1

if literalType == 1:
    literal = Input[INx]
else:
    literal = int(not Input[INx])

return literal

def FeedbackProbability(FeedbackType, ClauseOutput, Literal, taDecision, change): #print('FeedbackType', FeedbackType, 'ClauseOutput', ClauseOutput, 'Literal', Literal, 'taDecision', taDecision, 'change', change) FeedProb = 0.0 Feedrew = 0.0 Feedinact = 0.0 if change == 1: if FeedbackType == 1: if ClauseOutput == 1: if Literal == 1: if taDecision == 0: FeedProb = (s-1)/s

        else:
            if Literal == 1:
                if taDecision == 1:
                    FeedProb = 1/s

            else:
                if taDecision == 1:
                    FeedProb = 1/s

    else:
        if ClauseOutput == 1:
            if Literal == 0:
                if taDecision == 0:
                    FeedProb = 1
else:
    if FeedbackType == 1:
        if ClauseOutput == 1:
            if Literal == 1:
                if taDecision == 1:
                    Feedrew = (s-1)/s
                    Feedinact = 1/s
                else: 
                    Feedinact = 1/s
            else:
                if taDecision == 0:
                    Feedrew = 1/s
                    Feedinact = (s-1)/s
        else:
            if taDecision == 1:
                Feedinact = (s-1)/s
            else:
                Feedrew = 1/s
                Feedinact = (s-1)/s
    else:
        if ClauseOutput == 1:
            if Literal == 1:
                Feedinact = 1.0
        else:
            Feedinact = 1.0



if change == 1:                
    FeedProb = FeedProb
else:
    FeedProb = Feedrew + Feedinact
    #print(FeedProb)  
return FeedProb

for xaxis in range(len(Mx)): for yaxis in range(len(My)): p = 0 for case in range(4): # different Inputs if case == 0 or case == 3: FeedbackType = 2 else: FeedbackType = 1

        Clause1, Clause2 = CalculateClasueOutputs(Mx[xaxis,:,:],Input[case,:])
        Clause1Prime, Clause2Prime = CalculateClasueOutputs(My[yaxis,:,:],Input[case,:])
        ActProb = ActivationProbability(FeedbackType, Clause1+Clause2)

        if np.array_equal(Mx[xaxis,:,0:4], My[yaxis,:,0:4]):#clause1, no change case
            FeedProb = 1
            for index in range(4):
                literal = getLiteral(index, Input[case,:])
                change = 0
                FeedProb = FeedProb * FeedbackProbability(FeedbackType, Clause1, literal, Mx[xaxis,:,index], change)

            Pc1 = (ActProb * FeedProb) + (1 - ActProb)

        else:# clause1, change case
            FeedProb = 1
            for index in range(4):
                literal = getLiteral(index, Input[case,:])
                if Mx[xaxis,:,index] == My[yaxis,:,index]:
                    change = 0
                else: 
                    change = 1

                FeedProb = FeedProb * FeedbackProbability(FeedbackType, Clause1, literal, Mx[xaxis,:,index], change)

            Pc1 = ActProb * FeedProb

        if np.array_equal(Mx[xaxis,:,4:8], My[yaxis,:,4:8]):#clause2, no change case
            FeedProb = 1
            for kk in range(4):
                index = kk+4
                literal = getLiteral(index, Input[case,:])
                change = 0
                FeedProb = FeedProb * FeedbackProbability(FeedbackType, Clause2, literal, Mx[xaxis,:,index], change)

            Pc2 = (ActProb * FeedProb) + (1 - ActProb)

        else:#clause2, change case
            FeedProb = 1
            for kk in range(4):
                index = kk+4
                literal = getLiteral(index, Input[case,:])
                if Mx[xaxis,:,index] == My[yaxis,:,index]:
                    change = 0
                else: 
                    change = 1

                FeedProb = FeedProb * FeedbackProbability(FeedbackType, Clause2, literal, Mx[xaxis,:,index], change)

            Pc2 = ActProb * FeedProb


        p += 0.25 * Pc1 * Pc2
        #print(Pc1, Pc2)

    M[yaxis, xaxis] = p

MM = M.transpose()

for j in range(256): DiaSum[j,0] = sum(MM[j,:]) # sum of all row

np.savetxt('transitionmatrixstandard.csv', MM)

mm=np.linalg.matrix_power(MM, 1000000) ```

Owner

Name: Centre for Artificial Intelligence Research (CAIR)
Login: cair
Kind: organization
Email: cair-internal@uia.no
Location: Grimstad, Norway

Website: https://cair.uia.no/
Repositories: 68
Profile: https://github.com/cair

CAIR is a centre for research excellence on artificial intelligence at the University of Agder. We attack unsolved problems, seeking superintelligence.

GitHub Events

Total

Last Year

Issues and Pull Requests

Last synced: about 1 year ago

All Time

Total issues: 0
Total pull requests: 1
Average time to close issues: N/A
Average time to close pull requests: 2 minutes
Total issue authors: 0
Total pull request authors: 1
Average comments per issue: 0
Average comments per pull request: 0.0
Merged pull requests: 1
Bot issues: 0
Bot pull requests: 0

Past Year

Issues: 0
Pull requests: 0
Average time to close issues: N/A
Average time to close pull requests: N/A
Issue authors: 0
Pull request authors: 0
Average comments per issue: 0
Average comments per pull request: 0
Merged pull requests: 0
Bot issues: 0
Bot pull requests: 0

https://github.com/cair/tm-xor-proof

Science Score: 10.0%

Repository

Basic Info

Statistics

Metadata Files

README.md

Convergence-of-Tsetlin-Machine-for-Fundamental-Digital-Operations-the-XOR-Case

Code: XORDemo.py

!/usr/bin/python

import cython

Parameters for the Tsetlin Machine

Parameters of the pattern recognition problem

Training configuration

Loading of training and test data

This is a multiclass variant of the Tsetlin Machine, capable of distinguishing between multiple classes

Training of the Tsetlin Machine in batch mode. The Tsetlin Machine can also be trained online

Some performacne statistics

Output

Code: XORDemo_print.py

!/usr/bin/python

Parameters for the Tsetlin Machine

Parameters of the pattern recognition problem

Training configuration

Loading of training and test data

This is a multiclass variant of the Tsetlin Machine, capable of distinguishing between multiple classes

Training of the Tsetlin Machine in batch mode. The Tsetlin Machine can also be trained online

Some performacne statistics

Output

Code: proofofthe_convergence.py

My = np.zeros([254,1,8])

Owner

GitHub Events

Total

Last Year

Issues and Pull Requests

All Time

Past Year

Top Authors

Issue Authors

Pull Request Authors

Top Labels

Issue Labels

Pull Request Labels