Group Member:
Serhat Beyaz KÖROĞLU
Zeynep Beyza BÖREKÇİ
Büşra GÖRER
• Project Description
• Project goal & social problem addressed
• Project data
• Actions, Results and Discussion,Conclusion
• url address of the project
• References
• Appendix
We have 2 different behaviors: Students’ Choices and Employers’ Choices
First of all, we discussed these two behaviors and evaluated them separately.
Important Note: Since Kariyet.net makes this evaluation in the light of certain limitations, we have limited the students’ preferences in certain standards in order to ensure the consistency of the data. In addition to these, we made additional limitations to make the study meaningful.
We obtained the employer preferences from the data made by Kariyer.net. The “Employer Interest Index” published by Turkey’s leading employment platform Kariyer.net in 2020 sheds light on this problem. In this study, Kariyer.net analyzed 200 million transactions of 486,000 employer users who looking for new employees in the last 10 years. It then tried to make sense of these movements by matching them with the information on the candidates’ resumes. It determined the rate of employment of graduates in the first 2 years following university graduation, and which university or department graduates were more frequently interested in by employers.
1.1. Employer Interest Index
The standards that Kariyet.net takes into account when determining the index:
• Certain criteria were determined for university graduates who are among Kariyer.net members, and the data of 2 million people who fulfill these criteria were analyzed in the study.
• In the study, 200 million activities of 88 thousand employer users out of 486 thousand employers who were members of Kariyer.net and used Kariyer.net between 2009-2019 were examined.
Excluded from the data considered in the study:
• Within the scope of the study, the university and department from which the graduate candidates graduated at the undergraduate level,
• The first section that double major candidates add to their career.net resume,
• The most up-to-date CVs of candidates with more than one CV were taken into account.
Groups not included in the study:
• Candidates with open education and associate degree degrees: In order to make comparisons and results more reliable, open and distance education graduates with education levels other than undergraduate are excluded. • Candidates who do not have a graduation date on their resume: Candidates who do not include this information in their profile, as the absence of a graduation date will mislead the calculations of the relevant semester, are excluded from the study.
• Candidates who graduated from the departments of Medicine and Dentistry: In the study, employment status within 2 years following university graduation was examined. Since the students who graduated from the departments of medicine followed a different path by taking the specialty exams within 2 years after graduation, these occupational groups were excluded.
• Candidates who graduated from closed universities and departments: Since this study aims to provide information about the current situation to the individuals who will enter the university, the data of the candidates who graduated from the closed universities and departments are excluded.
• Candidates who graduated from universities abroad: Since the study was conducted for individuals who will enter universities in Turkey, candidates who graduated from universities abroad were excluded.
• Candidates who graduated from departments or universities with less than 60 graduates: These universities and departments were excluded in case the universities and/or departments with a small number of graduates could affect the results dramatically.
1.2. Outcomes of Employer Preferences
Location: Employers prefer students from universities in their region.
Foreign Language: In addition to English as a foreign language, they prefer universities that provide education in French and German.
They prefer well-established universities.
Important Note: Since Kariyet.net makes this evaluation in the light of certain limitations, we have limited the students’ preferences in certain standards in order to ensure the consistency of the data. In addition to these, we made additional limitations for the study to be meaningful.
We evaluated the university preferences of the students. In the light of these data, we created the “Student Interest Index”.
This is how we did it:
• We pulled the data of the university preference results of the students from the
• We divided it into universities one by one, and determined the student interest index of each university.
• We divided it into sections one by one, and determined the student interest index of each section.
2.1. Student Interest Index
Groups not included in the study:
• Candidates with open education and associate degree degrees: In order to make comparisons and results more reliable, open and distance education graduates with education levels other than undergraduate are excluded.
• Students who prefer Medicine and Dentistry departments.
• Interest index of departments: Since most popular departments are in universities, we have taken into account the top 25 rankings of that department in order not to decrease the index average and to get a more accurate value.
In addition to these, in order to obtain meaningful data:
• We have identified 23 universities that are at the forefront
• We have identified 15 departments that are at the forefront
We evaluated the data on this criteria.
2.2. Outcomes of Student Preferences
Location: Students choose their location between east-west options. They prefer universities in the West and South.
Foreign Language: English foreign language education has a high importance in student preferences. They also prefer universities that offer education in French and German.
They prefer well-established and prestigious universities.
By comparing Employer Preferences and Student preferences, similarities and differences are revealed. By comparing the two data, it is aimed to help students make more effective university choices.
3.1. Outcomes of Comparison of Preferences
• Location: The most important factor is the preference of employers and the preference of students in the regions closer to them. It was determined that the students did not make a choice in the light of this information in their preferences. Especially since Istanbul hosts 28.15% of the companies in Turkey, it is a great advantage to study at a university in this city.
Workplace Rates:
1- Istanbul: 28.15
2- Ankara: 7.74
3- Izmir: 7.00
4- Bursa: 4.16
5- Antalya: 3.86
6- Izmit: 2.49
• Foreign Language: We said that employers are looking for a foreign language. Students prefer foreign language education at least as much as employers. In addition to English, French and German languages also bring students to the fore.
• Well-established and prestigious Universities are preferred by both parties.
library(readr)
library(stringr)
## Warning: package 'stringr' was built under R version 4.0.5
library(readxl)
library(tidyverse)
## -- Attaching packages --------------------------------------- tidyverse 1.3.0 --
## v ggplot2 3.3.3 v purrr 0.3.4
## v tibble 3.1.0 v dplyr 1.0.5
## v tidyr 1.1.3 v forcats 0.5.1
## -- Conflicts ------------------------------------------ tidyverse_conflicts() --
## x dplyr::filter() masks stats::filter()
## x dplyr::lag() masks stats::lag()
library(ggplot2)
library(dplyr)
library(gapminder)
## Warning: package 'gapminder' was built under R version 4.0.5
library(hrbrthemes)
## Warning: package 'hrbrthemes' was built under R version 4.0.5
## NOTE: Either Arial Narrow or Roboto Condensed fonts are required to use these themes.
## Please use hrbrthemes::import_roboto_condensed() to install Roboto Condensed and
## if Arial Narrow is not on your system, please see https://bit.ly/arialnarrow
library(viridis)
## Warning: package 'viridis' was built under R version 4.0.5
## Loading required package: viridisLite
library(plotrix)
library(writexl)
## Warning: package 'writexl' was built under R version 4.0.5
OSYMsonuc <- read_excel("database/diger/osym_sonuc.xlsx") #Veriyi okuttum.
## Warning in read_fun(path = enc2native(normalizePath(path)), sheet_i = sheet, :
## Expecting numeric in L3078 / R3078C12: got '1-150'
## New names:
## * `` -> ...8
## * `` -> ...13
OSYMsonuc <- data.frame(OSYMsonuc$Üniversite, OSYMsonuc$`Fakülte veya Bölüm`, OSYMsonuc$Şehir, OSYMsonuc$`En Düşük Puan`, OSYMsonuc$`Kontenjan ve Geçen Yıldan Farkı`, OSYMsonuc$`Puan Türü`) #işime yarayacak sütunları aldım
#OSYMsonuc <- OSYMsonuc[!(OSYMsonuc$OSYMsonuc..Puan.Türü.=="DİL" | OSYMsonuc$OSYMsonuc..Puan.Türü.=="SÖZ" | OSYMsonuc$OSYMsonuc..Puan.Türü.=="EA"),] #DİL ve SÖZ puan türlerini temizledim.
OSYMsonuc <- OSYMsonuc[(OSYMsonuc$OSYMsonuc..Kontenjan.ve.Geçen.Yıldan.Farkı.>=30),] #Kontenjan 60 üstü olanları seçiyorum.
OSYMsonuc <- OSYMsonuc[!grepl("Tıp", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.),] #Tıp bölümleri çıkarıldı.
OSYMsonuc <- OSYMsonuc[!grepl("Diş", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.),] #Diş Bölümleri çıkarıldı.
OSYMsonuc <- OSYMsonuc[!grepl("Eczacılık", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.),]#Diş Bölümleri çıkarıldı.
OSYMsonuc <- OSYMsonuc[!grepl("Hemşirelik", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.),]
OSYMsonuc <- OSYMsonuc[!grepl("(İÖ)", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.),] #İkinci öğretim (İÖ) Bölümleri çıkarıldı.
OSYMsonuc <- OSYMsonuc[!grepl("(Açıköğretim)", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.),] #Açıköğretim bölümleri çıkarıldı.
#ÖSYM Verisi, Kariyer.net'in verileriyle karşılaştırma yapabilmek için temizlendi. Benzer kriterler uygulandı.
OSYMsonuc <- OSYMsonuc[order( OSYMsonuc[,4], decreasing= TRUE ), ] # puana göre sıraladım.
#1
OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm. <- chartr("ş ç ı ğ ü ö Ş Ç İ Ğ Ü Ö", "s c i g u o S C I G U O", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm. ) #Bunu 2 defa yapmış gibi gözüküyorum ama zorunda kaldım,çünkü stopwords dosyasındki türkçe karakterler bozuk çıkıyordu.
#2
library(tm)
## Warning: package 'tm' was built under R version 4.0.5
## Loading required package: NLP
##
## Attaching package: 'NLP'
## The following object is masked from 'package:ggplot2':
##
## annotate
stopwords = readLines('stopwords.txt')
OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm. = removeWords(OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm., stopwords)
#2.1
OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm. <- gsub("[()]", " ", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.)
OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm. <- gsub(" ", "", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.)
OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm. <- gsub(" - ", "-", OSYMsonuc$OSYMsonuc..Fakülte.veya.Bölüm.)
#3
OSYMsonuc <- unite(data=OSYMsonuc,col="uni-dept","OSYMsonuc.Üniversite","OSYMsonuc..Fakülte.veya.Bölüm.",sep=" ")
#4 - birleştirdikten sonra 2 boşluk yanyana gelebilir!
OSYMsonuc$`uni-dept` <- gsub(" ", " ", OSYMsonuc$`uni-dept`)
#5
OSYMsonuc$`uni-dept` <- chartr("ş ç ı ğ ü ö Ş Ç İ Ğ Ü Ö", "s c i g u o S C I G U O", OSYMsonuc$`uni-dept`)
OSYMsonuc$`uni-dept` <- toupper(OSYMsonuc$`uni-dept`)
OSYMsonuc$`uni-dept` <- gsub(" - ", "-", OSYMsonuc$`uni-dept`)
kariyer <- read_excel("database/diger/univebolum.xlsx") #veriyi okuttum
#Veri tek sütun içerisindeydi, 3 farklı veri var, 3 yeni sütuna ayırdım.
uni_name <- kariyer %>% filter(row_number() %% 4 == 1)
bolum_name <- kariyer %>% filter(row_number() %% 4 == 2)
endeks <- kariyer %>% filter(row_number() %% 4 == 3)
kariyer <- data.frame(uni_name,bolum_name, endeks) # 3 sütunu bir araya getirdim
rm(uni_name) #işi biten veriyi kaldırdım
rm(bolum_name)#işi biten veriyi kaldırdım
rm(endeks)#işi biten veriyi kaldırdım
#1 - parantez içlerinde türkçe karakter var, stopwords.txt türkçe karakteri kaldırmıyor, o yüzden bunu düzeltiyoruz.
kariyer$uni <- chartr("ş ç ı ğ ü ö Ş Ç İ Ğ Ü Ö", "s c i g u o S C I G U O", kariyer$uni)
kariyer$uni.1 <- chartr("ş ç ı ğ ü ö Ş Ç İ Ğ Ü Ö", "s c i g u o S C I G U O", kariyer$uni.1)
#2
stopwords = readLines('stopwords.txt')
kariyer$uni = removeWords(kariyer$uni, stopwords)
kariyer$uni.1 = removeWords(kariyer$uni.1, stopwords)
#2.1
kariyer$uni <- gsub("[()]", " ", kariyer$uni)
kariyer$uni <- gsub(" ", "", kariyer$uni)
kariyer$uni <- gsub(" - ", "-", kariyer$uni)
kariyer$uni.1 <- gsub("[()]", " ", kariyer$uni.1)
kariyer$uni.1 <- gsub(" ", "", kariyer$uni.1)
kariyer$uni.1 <- gsub(" - ", "-", kariyer$uni.1)
#3
kariyer$uni <- toupper(kariyer$uni)
kariyer$uni.1 <- toupper(kariyer$uni.1)
#4
kariyer <- unite(data=kariyer,col="uni-dept","uni","uni.1",sep=" ")
#5 - birleştirdikten sonra 2 boşluk yanyana gelebilir!
kariyer$`uni-dept` <- gsub(" ", " ", kariyer$`uni-dept`)
hepsi <- merge(OSYMsonuc, kariyer, by= 'uni-dept')
#hepsi0 <- count(hepsi0, `uni-dept`) - 1den fazla olanları görüntüledim.
hepsi<- hepsi[-c(753, 1392, 1012, 1418, 1151, 1031, 1037, 773, 751, 854, 858, 860, 1406, 1032, 1038, 1400, 1028, 780, 777, 765, 761, 759, 757, 748, 747 ),] #bazılarından 2 kapya yaptı, üst sıralardakileri ellerimle büyük olanı aldım, gerisini fonk. bıraktım.
hepsi <- distinct(hepsi, `uni-dept`, .keep_all= TRUE) # çift olanları teke düşürüyor.
names(hepsi)[2] <- 'city'
names(hepsi)[3] <- 'puan'
names(hepsi)[4] <- 'quota'
names(hepsi)[5] <- 'type_of_puan'
names(hepsi)[6] <- 'e.i.i'
hepsi$e.i.i <- as.numeric(hepsi$e.i.i)
hepsi$s.i.i <- as.numeric(hepsi[,3]/6.57)
hepsi$s.i.i <- round(hepsi$s.i.i, digits = 1)
hepsi <- data.frame( hepsi[,2], hepsi[,1], hepsi[,3], hepsi[,7], hepsi[,6], hepsi[,5] )
names(hepsi)[1] <- 'city'
names(hepsi)[2] <- 'uni_dept'
names(hepsi)[3] <- 'puan'
names(hepsi)[4] <- 's.i.i'
names(hepsi)[5] <- 'e.i.i'
names(hepsi)[6] <- 'type_of_puan'
#write_xlsx(hepsi, "database\\ilgi\\final.xlsx")
library(sf)
## Warning: package 'sf' was built under R version 4.0.5
## Linking to GEOS 3.9.0, GDAL 3.2.1, PROJ 7.2.1
turkeyGeo <- readRDS("database/tr/gadm36_TUR_1_sf.rds")
turkeyGeo[c(1,2,3,5,6,7,8,9,10)] <- NULL
turkeyGeo$NAME_1 <- chartr("ş ç ı ğ ü ö Ş Ç İ Ğ Ü Ö", "s c i g u o S C I G U O", turkeyGeo$NAME_1)
turkeyGeo$NAME_1 <- toupper(turkeyGeo$NAME_1)
turkeyGeo[42,1] <- 'KAHRAMANMARAS'
turkeyGeo[49,1] <- 'KIRIKKALE'
turkeyGeo[81,1] <- 'ZONGULDAK' #My lovely city where I was born in
names(turkeyGeo)[1] <- 'CityName'
#install.packages("rvest")
library(rvest)
## Warning: package 'rvest' was built under R version 4.0.5
##
## Attaching package: 'rvest'
## The following object is masked from 'package:readr':
##
## guess_encoding
# İllere göre üniversite sayısı için veriyi çektim.
il_uni <- read_html("https://blog.youthall.com/turkiyedeki-universiteler-listesi/")
il_uni <- il_uni %>%
html_nodes("table") %>%
html_table()
il_uni <- as.data.frame(il_uni)
# Her ilde kaç üniversite var count ile saydım.
library(dplyr)
il_uni <- count(il_uni, Şehir)
il_uni$Şehir <- chartr("ş ç ı ğ ü ö Ş Ç İ Ğ Ü Ö Â", "s c i g u o S C I G U O A", il_uni$Şehir)
il_uni$Şehir <- toupper(il_uni$Şehir)
il_uni[3,1] <- 'AFYON'
names(il_uni)[1] <- 'CityName'
il_uni <- merge(turkeyGeo, il_uni, by= 'CityName', all.x= TRUE)
rm(turkeyGeo) #çevremizi temiz tutalım
# install.packages("leaflet")
library(leaflet)
## Warning: package 'leaflet' was built under R version 4.0.5
st_crs(il_uni) # User input: EPSG:4326 -> Good!
## Coordinate Reference System:
## User input: EPSG:4326
## wkt:
## GEOGCRS["WGS 84",
## DATUM["World Geodetic System 1984",
## ELLIPSOID["WGS 84",6378137,298.257223563,
## LENGTHUNIT["metre",1]]],
## PRIMEM["Greenwich",0,
## ANGLEUNIT["degree",0.0174532925199433]],
## CS[ellipsoidal,2],
## AXIS["geodetic latitude (Lat)",north,
## ORDER[1],
## ANGLEUNIT["degree",0.0174532925199433]],
## AXIS["geodetic longitude (Lon)",east,
## ORDER[2],
## ANGLEUNIT["degree",0.0174532925199433]],
## USAGE[
## SCOPE["Horizontal component of 3D system."],
## AREA["World."],
## BBOX[-90,-180,90,180]],
## ID["EPSG",4326]]
# il_uni <- st_transform(il_uni, crs = 4326) (Gerek yok)
plot(st_geometry(il_uni))
library(dplyr)
library(ggplot2)
library(viridis)
il_uni %>%
ggplot() +
geom_sf(aes(fill = n), color = "white") +
theme_void() +
ggtitle("İllere Göre Üniversite Yoğunluğu") +
theme(title = element_text(face="bold"))
library(readxl)
final <- read_excel("database/ilgi/final.xlsx")
hepsi <- final
# library
library(ggplot2)
library(ggExtra)
## Warning: package 'ggExtra' was built under R version 4.0.5
# The mtcars dataset is proposed in R
head(hepsi)
## # A tibble: 6 x 6
## city uni_dept puan s.i.i e.i.i type_of_puan
## <chr> <chr> <dbl> <dbl> <dbl> <chr>
## 1 AFYONKARAH~ AFYON KOCATEPE UNIVERSITESI ELEKTR~ 284. 43.2 18.9 SAY
## 2 AFYONKARAH~ AFYON KOCATEPE UNIVERSITESI GASTRO~ 349. 53.1 18.4 SÖZ
## 3 AFYONKARAH~ AFYON KOCATEPE UNIVERSITESI GIDA M~ 294. 44.7 18.4 SAY
## 4 AFYONKARAH~ AFYON KOCATEPE UNIVERSITESI ISLETME 236. 36 17.4 EA
## 5 AFYONKARAH~ AFYON KOCATEPE UNIVERSITESI LOJIST~ 221. 33.7 18.2 EA
## 6 AFYONKARAH~ AFYON KOCATEPE UNIVERSITESI MAKINE~ 288. 43.8 21.1 SAY
# classic plot :
p <- ggplot(hepsi, aes(x=e.i.i, y=s.i.i, color=uni_dept, size=puan)) +
geom_point() +
ylab("Student Interest Index") +
xlab("Employer Interest Index") +
theme(legend.position="none")
plot(p)
# Set relative size of marginal plots (main plot 10x bigger than marginals)
p1 <- ggMarginal(p, type="histogram", size=10)
plot(p1)
# Custom marginal plots:
p2 <- ggMarginal(p, type="histogram", fill = "8d744b", xparams = list( bins=10))
plot(p2)
# Show only marginal plot for x axis
p3 <- ggMarginal(p, margins = 'x', color="purple", size=4)
plot(p3)
Finally, and most importantly, we compared the “Employer Interest Index” with the “Student Interest Index”. We have listed the departments and universities in order to show a more efficient university and department preference.
ankara <- read_excel("database/uni/ankara.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
ankara <- ankara[-c(11,13,18,20,28:30,58,59,104,105,122:126),c(3,5,10)]
ogrenci_ankara <- mean(ankara$`En Düşük Puan`)
ilgi_ankara<- c("ANKARA UNIVERSITESI", ogrenci_ankara)
bahcesehir<- read_excel("database/uni/bahcesehir.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
bahcesehir<- bahcesehir[,c(3,5,10)]
ogrenci_bahcesehir<- mean((bahcesehir$`En Düşük Puan`))
ilgi_bahcesehir<- c("BAHCESEHIR UNIVERSITESI", ogrenci_bahcesehir)
ilgi_uni<- rbind(ilgi_ankara, ilgi_bahcesehir)
bilgi<- read_excel("database/uni/bilgi.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
bilgi<- bilgi[,c(3,5,10)]
ogrenci_bilgi<-mean(bilgi$`En Düşük Puan`)
ilgi_bilgi<- c("ISTANBUL BILGI UNIVERSITESI", ogrenci_bilgi)
bilkent <- read_excel("database/uni/bilkent.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
bilkent<- bilkent[,c(3,5,10)]
ogrenci_bilkent<- mean(bilkent$`En Düşük Puan`)
ilgi_bilkent<- c("IHSAN DOGRAMACI BILKENT UNIVERSITESI", ogrenci_bilkent)
bogazici <- read_excel("database/uni/bogazici.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
bogazici<- bogazici[-c(30,31,36),c(3,5,10)]
ogrenci_bogazici<- mean(bogazici$`En Düşük Puan`)
ilgi_bogazici<- c("BOGAZICI UNIVERSITESI", ogrenci_bogazici)
dokuzeylul <- read_excel("database/uni/dokuzeylul.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
dokuzeylul<- dokuzeylul[-c(8,27:30,34,36,38:40,43,44,53,62,97,96,98:100),c(3,5,10)]
ogrenci_dokuzeylul<- mean(dokuzeylul$`En Düşük Puan`)
ilgi_dokuzeylul<- c("DOKUZ EYLUL UNIVERSITESI", ogrenci_dokuzeylul)
ege <- read_excel("database/uni/ege.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
ege<- ege[-c(12,16,20,36,44,61:64),c(3,5,10)]
ogrenci_ege<- mean(ege$`En Düşük Puan`)
ilgi_ege<- c("EGE UNIVERSITESI", ogrenci_ege)
galatasaray <- read_excel("database/uni/galatasaray.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
galatasaray<- galatasaray[,c(3,5,10)]
ogrenci_galatasaray<- mean(galatasaray$`En Düşük Puan`)
ilgi_galatasaray<- c("GALATASARAY UNIVERSITESI", ogrenci_galatasaray)
gazi <- read_excel("database/uni/gazi.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
gazi<- gazi[-c(2,14,38,42,52:56,61,63,64),c(3,5,10)]
ogrenci_gazi<- mean(gazi$`En Düşük Puan`)
ilgi_gazi<- c("GAZI UNIVERSITESI", ogrenci_gazi)
gyte <- read_excel("database/uni/gyte.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
gyte<- gyte[,c(3,5,10)]
ogrenci_gyte<- mean(gyte$`En Düşük Puan`)
ilgi_gyte<- c("GEBZE TEKNIK UNIVERSITESI", ogrenci_gyte)
hacettepe <- read_excel("database/uni/hacettepe.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
hacettepe<- hacettepe[-c(12,59,71:73),c(3,5,10)]
ogrenci_hacettepe<- mean(hacettepe$`En Düşük Puan`)
ilgi_hacettepe<- c("HACETTEPE UNIVERSITESI", ogrenci_hacettepe)
itu <- read_excel("database/uni/itu.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
itu<- itu[-c(2,3,43,48,52:63),c(3,5,10)]
ogrenci_itu<- mean(itu$`En Düşük Puan`)
ilgi_itu<- c("ISTANBUL TEKNIK UNIVERSITESI", ogrenci_itu)
iu <- read_excel("database/uni/iu.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
iu<- iu[-c(23,24,26,53:68,71,85,93,99:103,104,141:144),c(3,5,10)]
ogrenci_iu<- 363.2045
ilgi_iu<- c("ISTANBUL UNIVERSITESI", ogrenci_iu )
85,93,99:103,104,141:144)
iyte <- read_excel("database/uni/iyte.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
iyte<- iyte[-c(17,20),c(3,5,10)]
ogrenci_iyte<- mean(iyte$`En Düşük Puan`)
ilgi_iyte<- c("IZMIR YUKSEK TEKNOLOJI ENSTITUSU", ogrenci_iyte)
koc <- read_excel("database/uni/koc.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
koc<- koc[,c(3,5,10)]
ogrenci_koc<- mean(koc$`En Düşük Puan`)
ilgi_koc<- c("KOC UNIVERSITESI", ogrenci_koc)
kocaeli <- read_excel("database/uni/kocaeli.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
kocaeli<- kocaeli[-c(8,14,18,19,21,23,24,26,27,30,34,44,47,57,59,62,66,67,68,80,81),c(3,5,10)]
ogrenci_kocaeli<- mean(kocaeli$`En Düşük Puan`)
ilgi_kocaeli<- c("KOCAELI UNIVERSITESI",ogrenci_kocaeli )
marmara <- read_excel("database/uni/marmara.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
marmara<- marmara[-c(9,25,26,36:38,63,66,67,69,70,79,81,91),c(3,5,10)]
ogrenci_marmara<- mean(marmara$`En Düşük Puan`)
ilgi_marmara<- c("MARMARA UNIVERSITESI", ogrenci_marmara)
odtu <- read_excel("database/uni/odtu.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
odtu<- odtu[-c(2,3,11:19,36,49:56),c(3,5,10)]
ogrenci_odtu<- mean(odtu$`En Düşük Puan`)
ilgi_odtu<- c("ORTA DOGU TEKNIK UNIVERSITESI", ogrenci_odtu)
ozyegin <- read_excel("database/uni/ozyegin.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
ozyegin<- ozyegin[,c(3,5,10)]
ogrenci_ozyegin<- mean(ozyegin$`En Düşük Puan`)
ilgi_ozyegin<- c("OZYEGIN UNIVERSITESI", ogrenci_ozyegin )
sabanci <- read_excel("database/uni/sabanci.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
sabanci <- sabanci[,c(3,5,10)]
ogrenci_sabanci <-mean(sabanci$`En Düşük Puan`)
ilgi_sabanci <- c("SABANCI UNIVERSITESI", ogrenci_sabanci)
turkalman <- read_excel("database/uni/turkalman.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
turkalman<- turkalman[,c(3,5,10)]
ogrenci_turkalman <-mean(turkalman$`En Düşük Puan`)
ilgi_turkalman<- c("TURK-ALMAN UNIVERSITESI", ogrenci_turkalman)
uludag <- read_excel("database/uni/uludag.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
uludag <- uludag[-c(15, 16, 22, 23, 25, 26, 29, 32, 40, 42, 61, 68, 71, 75, 76),c(3,5,10)]
ogrenci_uludag <- mean(uludag$`En Düşük Puan`)
ilgi_uludag<- c("ULUDAG UNIVERSITESI", ogrenci_uludag)
yildiz <- read_excel("database/uni/yildiz.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
yildiz<- yildiz[-c(35, 51, 52),c(3,5,10)]
ogrenci_yildiz <- mean(yildiz$`En Düşük Puan`)
ilgi_yildiz <- c("YILDIZ TEKNIK UNIVERSITESI", ogrenci_yildiz)
isveren_ilgi_uni <- read_excel("database/ilgi/isveren_ilgi_uni.xlsx")
colnames(isveren_ilgi_uni)<- c("Universiteler", "Isveren.Ilgi.Endeksi")
ilgi_uni<- rbind(ilgi_ankara, ilgi_bahcesehir, ilgi_bilgi, ilgi_bilkent, ilgi_bogazici, ilgi_dokuzeylul, ilgi_ege, ilgi_galatasaray, ilgi_gazi, ilgi_gyte, ilgi_hacettepe, ilgi_itu, ilgi_iu, ilgi_iyte, ilgi_koc, ilgi_kocaeli, ilgi_marmara, ilgi_odtu, ilgi_ozyegin, ilgi_sabanci, ilgi_turkalman, ilgi_uludag, ilgi_yildiz)
colnames(ilgi_uni)<- c("Universiteler", "Ogrenci.Ilgi.Endeksi")
ilgi_uni<- merge(ilgi_uni, isveren_ilgi_uni, by='Universiteler')
ilgi_uni$Universiteler <- as.character(ilgi_uni$Universiteler)
ilgi_uni$Ogrenci.Ilgi.Endeksi <- as.numeric(ilgi_uni$Ogrenci.Ilgi.Endeksi)
ilgi_uni$Ogrenci.Ilgi.Endeksi <- round(ilgi_uni$Ogrenci.Ilgi.Endeksi, digits = 1)
ilgi_uni$s.i.i <- as.character(ilgi_uni[,2]/5)
ilgi_uni$s.i.i.num <- as.numeric(ilgi_uni[,2]/5)
ilgi_uni$Isveren.Ilgi.Endeksi <- as.numeric(ilgi_uni$Isveren.Ilgi.Endeksi)
ilgi_uni$Isveren.Ilgi.Endeksi <- round(ilgi_uni$Isveren.Ilgi.Endeksi, digits = 1)
ilgi_uni$e.i.i <- as.character(ilgi_uni[,3])
ilgi_uni$e.i.i.num <- as.numeric(ilgi_uni[,3])
write_xlsx(ilgi_uni, "database\\ilgi\\ilgi_uni.xlsx")
ilgi_uni %>%
group_by(Universiteler) %>%
ungroup() %>%
ggplot(aes(x=s.i.i.num, y=reorder(Universiteler, s.i.i.num) ,fill=s.i.i)) +
ylab("Universities") +
xlab("Student Interest Index") +
geom_col() +
scale_fill_viridis_d() +
guides(fill = guide_legend(reverse = TRUE) ) +
theme(
panel.grid = element_blank(),
panel.background = element_blank())
ilgi_uni %>%
group_by(Universiteler) %>%
ungroup() %>%
ggplot(aes(x=e.i.i.num, y=reorder(Universiteler, e.i.i.num) ,fill=e.i.i)) +
ylab("Universities") +
xlab("Employer Interest Index") +
geom_col() +
scale_fill_viridis_d() +
guides(fill = guide_legend(reverse = TRUE) ) +
theme(
panel.grid = element_blank(),
panel.background = element_blank())
ilgi_uni %>% group_by(Universiteler) %>% ungroup() %>% ggplot(aes(x=e.i.i.num, y=reorder(Universiteler, e.i.i.num) ,fill=e.i.i)) + ylab(“Universities”) + xlab(“Employer Interest Index”) + geom_col() + scale_fill_viridis_d() + guides(fill = guide_legend(reverse = TRUE) ) + theme( panel.grid = element_blank(), panel.background = element_blank())
bilgisayar<- read_excel("database/bolum/bilgisayar.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
isveren_ilgi_bolum <- read_excel("database/ilgi/isveren_ilgi_bolum.xlsx")
bilgisayar<- bilgisayar[c(1:25),]
ogrenci_bilgisayar<- mean((bilgisayar$`En Düşük Puan`))
ilgi_bilgisayar<- c("BILGISAYAR MUHENDISLIGI", ogrenci_bilgisayar, isveren_ilgi_bolum[13,2])
ekonomi<- read_excel("database/bolum/ekonomi.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
ekonomi<- ekonomi[c(1:25),]
ogrenci_ekonomi<- mean((ekonomi$`En Düşük Puan`))
ilgi_ekonomi<- c("EKONOMI", ogrenci_ekonomi, isveren_ilgi_bolum[19,2])
elektronik<- read_excel("database/bolum/elektronik.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
elektronik<- elektronik[c(1:25),]
ogrenci_elektronik<- mean((elektronik$`En Düşük Puan`))
ilgi_elektronik<- c("ELEKTRIK ELEKTRONIK MUHENDISLIGI", ogrenci_elektronik, isveren_ilgi_bolum[26,2])
endustri<- read_excel("database/bolum/endustri.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
endustri<- endustri[c(1:25),]
ogrenci_endustri<- mean((endustri$`En Düşük Puan`))
ilgi_endustri<- c("ENDUSTRI MUHENDISLIGI", ogrenci_endustri, isveren_ilgi_bolum[9,2])
fizik<- read_excel("database/bolum/fizik.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
fizik<- fizik[c(1:25),]
ogrenci_fizik<- mean((fizik$`En Düşük Puan`))
ilgi_fizik<- c("FIZIK MUHENDISLIGI", ogrenci_fizik, isveren_ilgi_bolum[172,2])
hukuk<- read_excel("database/bolum/hukuk.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
hukuk<- hukuk[c(1:25),]
ogrenci_hukuk<- mean((hukuk$`En Düşük Puan`))
ilgi_hukuk<- c("HUKUK", ogrenci_hukuk, isveren_ilgi_bolum[221,2])
insaat<- read_excel("database/bolum/insaat.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
insaat<- insaat[c(1:25),]
ogrenci_insaat<- mean((insaat$`En Düşük Puan`))
ilgi_insaat<- c("INSAAT MUHENDISLIGI", ogrenci_insaat, isveren_ilgi_bolum[205,2])
isletme<- read_excel("database/bolum/isletme.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
isletme<- isletme[c(1:25),]
ogrenci_isletme<- mean((isletme$`En Düşük Puan`))
ilgi_isletme<- c("ISLETME", ogrenci_isletme, isveren_ilgi_bolum[24,2])
kimya<- read_excel("database/bolum/kimya.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
kimya<- kimya[c(1:25),]
ogrenci_kimya<- mean((kimya$`En Düşük Puan`))
ilgi_kimya<- c("KIMYA", ogrenci_kimya, isveren_ilgi_bolum[118,2])
kimyamuh<- read_excel("database/bolum/kimyamuh.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
kimyamuh<- kimyamuh[c(1:25),]
ogrenci_kimyamuh<- mean((kimyamuh$`En Düşük Puan`))
ilgi_kimyamuh<- c("KIMYA MUHENDISLIGI", ogrenci_kimyamuh, isveren_ilgi_bolum[32,2])
makine<- read_excel("database/bolum/makine.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
makine<- makine[c(1:25),]
ogrenci_makine<- mean((makine$`En Düşük Puan`))
ilgi_makine<- c("MAKINE MUHENDISLIGI", ogrenci_makine, isveren_ilgi_bolum[66,2])
matematik<- read_excel("database/bolum/matematik.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
matematik<- matematik[c(1:25),]
ogrenci_matematik<- mean((matematik$`En Düşük Puan`))
ilgi_matematik<- c("MATEMATIK", ogrenci_matematik, isveren_ilgi_bolum[213,2])
mbg<- read_excel("database/bolum/mbg.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
mbg<- mbg[c(1:25),]
ogrenci_mbg<- mean((mbg$`En Düşük Puan`))
ilgi_mbg<- c("MOLEKULER BIYOLOJI VE GENETIK", ogrenci_mbg, isveren_ilgi_bolum[246,2])
mimarlik<- read_excel("database/bolum/mimarlik.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
mimarlik<- mimarlik[c(1:25),]
ogrenci_mimarlik<- mean((mimarlik$`En Düşük Puan`))
ilgi_mimarlik<- c("MIMARLIK", ogrenci_mimarlik, isveren_ilgi_bolum[173,2])
psikoloji<- read_excel("database/bolum/psikoloji.xlsx")
## New names:
## * `` -> ...8
## * `` -> ...13
psikoloji<- psikoloji[c(1:25),]
ogrenci_psikoloji<- mean((psikoloji$`En Düşük Puan`))
ilgi_psikoloji<- c("PSIKOLOJI", ogrenci_psikoloji, isveren_ilgi_bolum[220,2])
ilgi_bolum <- c( ilgi_bilgisayar, ilgi_elektronik, ilgi_endustri, ilgi_makine, ilgi_hukuk, ilgi_matematik, ilgi_mimarlik, ilgi_kimya, ilgi_psikoloji, ilgi_mbg, ilgi_isletme, ilgi_kimyamuh, ilgi_insaat, ilgi_fizik, ilgi_ekonomi)
ilgi_bolum<-matrix(ilgi_bolum, nrow=15, byrow=TRUE)
colnames(ilgi_bolum)<- c("Bolumler","Ogrenci.Ilgi.Endeksi","Isveren.Ilgi.Endeksi")
ilgi_bolum <- data.frame(ilgi_bolum)
ilgi_bolum$Bolumler <- as.character(ilgi_bolum$Bolumler)
ilgi_bolum$Ogrenci.Ilgi.Endeksi <- as.numeric(ilgi_bolum$Ogrenci.Ilgi.Endeksi)
ilgi_bolum$Ogrenci.Ilgi.Endeksi <- round(ilgi_bolum$Ogrenci.Ilgi.Endeksi, digits = 1)
ilgi_bolum$s.i.i <- as.character(ilgi_bolum[,2]*9/50)
ilgi_bolum$s.i.i.num <- as.numeric(ilgi_bolum[,2]*9/50)
ilgi_bolum$Isveren.Ilgi.Endeksi <- as.numeric(ilgi_bolum$Isveren.Ilgi.Endeksi)
ilgi_bolum$Isveren.Ilgi.Endeksi <- round(ilgi_bolum$Isveren.Ilgi.Endeksi, digits = 1)
ilgi_bolum$e.i.i <- as.character(ilgi_bolum[,3])
ilgi_bolum$e.i.i.num <- as.numeric(ilgi_bolum[,3])
ilgi_bolum %>%
group_by(Bolumler) %>%
ungroup() %>%
ggplot(aes(x=s.i.i.num, y=reorder(Bolumler, s.i.i.num) ,fill=s.i.i)) +
ylab("Departments") +
xlab("Student Interest Index") +
geom_col() +
scale_fill_viridis_d() +
guides(fill = guide_legend(reverse = TRUE) ) +
theme(
panel.grid = element_blank(),
panel.background = element_blank())
ilgi_bolum %>%
group_by(Bolumler) %>%
ungroup() %>%
ggplot(aes(x=e.i.i.num, y=reorder(Bolumler, e.i.i.num) ,fill=e.i.i)) +
ylab("Departments") +
xlab("Employer Interest Index") +
geom_col() +
scale_fill_viridis_d() +
guides(fill = guide_legend(reverse = TRUE) ) +
theme(
panel.grid = element_blank(),
panel.background = element_blank())
write_xlsx(ilgi_bolum, "database\\ilgi\\ilgi_bolum.xlsx")