SupportGenieEx1

This program was made as an assignment to submit for an internship as a test. The code will be explained as detailed as possible. Any issues in the code or its explination are welcome in the Issues section.

Disclaimer

The program works on the basis of generated data. In order to achieve this, a random data generator is made such that the data generated is close to what is expected from a good call service provider. Due to this data generation takes time and I would highly recommend that in order to not use parameters that are too large as it would take alot of time to generate it's data.

The accuracy of the program is always questionable. I guess we'll never know until real world data is used and verified.

You can skip the explination and directly go to the user manual if you want but I would recommend reading this once.

Introduction

This program is used to predict the time a caller would have to wait before the request would be answered. It follows a simple set of steps in order to do so.

Note: It would help to have the code open in another window

If you are looking for a machine learning solution to this problem, you may stop reading this if you feel my justification for not using it is not convincing. If you aren't you can just skip ahead.

Why not machine learning?

I have not used machine learning partly as I don't know any (Do mind its not that I don't want to learn) but mostly because of the following reasons:

The mammoth ammout of data required and the 1000 new parameters I will have to create to train a model that would give me probably no proper conclusions due to the highly random nature of human beings.
I am using pseduo random numbers so the model would just find the pattern that generates these random numbers which would be a waste of effort.
The number ranges of random numbers generated are based on statistics of good waiting times thus apart from correction of the phases that take too long out of the blue this should be able to give resonably good predictions
For good predictiont the model would have to be very complicated.
This isn't enough time for me to master machine learning and I dont want to just slap in some statistical model without knowing what it is, and all that I think I know about machine learning is wrong and this would have been much simpler had I used it.

Maybe for another time this could be possible provided the resources from real world to work on it.

Now if you aren't convinced you may leave.

The Classes

The classes are fairly straight forward.

Agent

Attributes:

_agents is used to store the refernce to the objects created of Agent class.

agent_no Stores what is equivalent to and id number given to an agent

issue_assigned Stores None, when no issue is assigned to it and stores refernce to and object of type Issue if an issue is assigned to an agent. Agent.

Functions:

get_intances(cls) is a class method that is used to retive _agents

Issue

Attributes:

_issues is used to store the refernce to the objects created of Issue class.

arrival_time Time at which the issue arrives.

result Stores 'r' or 'a' indicating responded or abandoned. If issue is responded to response_time and completion_time are set. If issue was abandoned abandoned_time is set.

response_time Time at which the issue was request was accepted by an agent.

completion_time Time at which the issue was closed by agent.

abandoned_time Time at which Issue was abandoned.

Functions:

avg_response_time(cls) Class method used to generate the average time taken to respond by all the objects created.

avg_abandonment_time(cls) Class method used to generate the average time before which a user would leave the request as it was not responded to.

clear_objs(cls) Class method that is used to delete all instances of class.

push_to_json(cls, file) Class method used to move all current class methods onto a json file. Used to debug the program and also see the data generated. Note: The json dumped into file is not getting formated for some reason. Uncomment the print statement to print formatted json on the console or if you know the mistake let me know in the Issues section.

The Code

** Click on the heading to take yourself to the file containting the code being explained.

Generating Data

Most of the Data is generated using Data_generators.stats_generator.generate_data(). It works as a control unit calling other functions of Data_generators. The data generated is for a maximum of 7hrs or in the case of determining waiting time the number of requests.

`generate_data()`

This function is used for 2 operations, to generate the data upon which perdictions are made and to generate the data before the request whoose waiting time is to be predicted. The variables h = 9, m = 0, s = 5 are indicating the service starts at 09:00:05. This would not be required really in the real world its just to generate the data. queue is used to hold all the objects that have arrived but not been assigned to an Agent.

for i in range(no_of_agents):
      agent = create_agent(i)
      h, m, s = make_time(h, m, s, no_of_agents)
      arrival_time = str(f'{h}:{m}:{s}')
      req_no += 1
      generated_issue = Issue.Issue(arrival_t=arrival_time)
      agent.issue_assigned = generated_issue
      fill_up_object(agent)

The agents are created using create_agents() one at a time and the issues are generated and assigned to the created agents. make_time()(read_more_here) is used to generate time in incrementals specific to the workload meant for given number of agents. The data is filled by fill_up_object() (more_here).

Making sure that issues generated are below working hours, the issues are created and appended to the queue, the agents are checked for accomodating issues in the queue using agent_queue_handler() and the cycle continues till the last element is generated.

if no_of_req is not None and no_of_req == req_no:
          break

The above snipped is used when a finite number of data must be generated .ie. no_of_req is passed to function call. This is done when the function is used in prediction phase.

The average response time and the average abandonment time is then calculated for the dataset using the respective class functions.

The rest of the function belongs to predictive side of this function and shall be explained below.

Calculate the response and abandonment times

The calculations are performed by the 2 Issue class methods avg_response_time() and avg_abandonment_time()

`avg_response_time()`

When the function is called much like most averge functions, the total_reslution_time is calculated as sum of differences between response time and arrival time. It is then divided by the total no of objects that were taken into consideration .ie. objects with 'r' as result and then returned.

`avg_abandonment_time()`

Similar to calulating the average response time, objects that are abandoned are selected using result property ('a') and then summed together and divided by the number of abandonments to give the average abandonment time.

Create data based on number of requests

This part involves calling the generate_data() function with the parameters no_of_req,avg_res_time, and avg_aban_time that were generated from the functions previous execution. The data generation part is the same as explained already.

if no_of_req is not None and no_of_req == req_no:
            break

Here the above snippet comes into action limiting the number of requests generated to no_of_req

Correcting the calculations and predition

This part involves making corrections by the new trends and predicting the value.

if no_of_req == req_no:
        ele_in_queue = len(queue)-1
        avg_response_time = (avg_response_time + avg_res_time) // 2
        multiplier = ele_in_queue // no_of_agents + 1
        waiting_time = avg_response_time*multiplier
    else:
        waiting_time = 0

Here first the new value of response time (avg_response_time) and the recieved value (avg_res_time) are averaged correcting for the new trend in data. The elements in the queue determine which cycle does a person belong. ex: There are 18 elements in the queue and 10 agents then, multiplier = 18//10 + 1 = 1+1 = 2 This means that the person is in the second cycle of the queue and the waiting time would be, avg_response_time * 2. So if the average response time is 30seconds, the prediction for this person would be a wait time of 60s or 1min.

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
Classes		Classes
Data		Data
Data_generators		Data_generators
.gitignore		.gitignore
README.md		README.md
USER_MANUAL.md		USER_MANUAL.md
main.py		main.py
tester.py		tester.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SupportGenieEx1

Disclaimer

Introduction

Why not machine learning?

The Classes

Agent

Attributes:

Functions:

Issue

Attributes:

Functions:

The Code

Generating Data

`generate_data()`

Calculate the response and abandonment times

`avg_response_time()`

`avg_abandonment_time()`

Create data based on number of requests

Correcting the calculations and predition

Read the USER_MANUAL to understand how to use the program.

About

Releases

Packages

Languages

Treshank/SupportGenieEx1

Folders and files

Latest commit

History

Repository files navigation

SupportGenieEx1

Disclaimer

Introduction

Why not machine learning?

The Classes

Agent

Attributes:

Functions:

Issue

Attributes:

Functions:

The Code

Generating Data

generate_data()

Calculate the response and abandonment times

avg_response_time()

avg_abandonment_time()

Create data based on number of requests

Correcting the calculations and predition

Read the USER_MANUAL to understand how to use the program.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`generate_data()`

`avg_response_time()`

`avg_abandonment_time()`

Packages