New feature added by Facebook, Facebook photos and videos can be moved to Google Photos

Facebook, the most widely used social network in the world, has added a new feature.

Facebook, a new feature launched in the US and Canada last April, is now open to the world.

Facebook has made this service global by improving it based on comments and suggestions from users in the US and Canada.

According to the new feature, photos and videos posted on Facebook can now be moved to Google Photos and stored.

The company has stated that some procedures have to be followed to transfer photos from Facebook to Google.



To move a photo to Google, you need to go to Facebook's 'Your Facebook Information'.

For that you have to go to settings.

Then go to 'Transfer a copy of your photos and videos'.


After clicking there, the photo on Facebook will be stored in Google Photos, according to Facebook officials.

Passwords are also requested to start the photo transfer service.

After linking your Facebook and Google Accounts, users will be able to move their Facebook photos to Google Photos.



Privacy and security challenges for  Xiaomi Phone users


Users of the Chinese phone Xiaomi Phone face privacy and security challenges. As the phone will be used for unauthorized recording and tapping of mobile phones of the users, the privacy of the users of those phones has been questioned. Experts say that this has added to the challenge of cyber security.

Various studies have shown that this phone has more problems with Xiaomi Phone Mi 10, Saomi Ridmi K20, Saomi Mix 3 and other phones.

In a query from Forbes magazine, the company said privacy and security were its top concerns and was unaware of such issues.

The weaknesses of this phone have become public after the data of watching porn videos of various users of Xiaomi Phone came out. According to analytical studies, the incident has brought a significant smile to Saomi's users.



A study by Andrew Deerney, a cybersecurity researcher, found that Saomi sent a lot of data and browsers to Google Play.

It violated the privacy of millions of users, according to a report in Forbes Magazine. Saomi has admitted that it leaked while trying to secure the user's data.

This problem has been seen in not only one of Saomi's but also many series of phones, which has caused psychological panic among the users. At the same time, the impact of the leaked confidential data will be even more challenging in the coming days.

Researchers have concluded that this is even more dangerous as all the activities of the users of the phone will be tracked.


In collaboration with Saomi, this data can be accessed and used by anyone, according to Forbes News.

According to the news, a customer of Saomi had reached the office of Forbes with the phone set of Saomi of Woke. Forbes then proceeded with its research on the subject.

Apple is making history again, becoming the world's first 2 trillion company

Apple is making history again, becoming the world's first 2 trillion company

Apple Corps, the world's first non-governmental company with a market value of  1 trillion in 2018, is crossing another milestone.



Apple is set to become the world's first company with a market value of 2 trillion in the near future.

The recent outbreak of the corona virus has caused many companies around the world to fluctuate, and Apple has not been spared. But despite such adversity, analysts estimate that Apple's market value will cross the 2 trillion mark in the near future.

Amit Daryanani, a well-known tech analyst at Cultofmack, made this claim in his latest research report. According to him, the market value of Apple will reach 2 trillion dollars by September 2024. According to the report, by 2024, Apple's earnings per share will reach  23.

His projection is based on a model of 14 percent annual EPS growth. He said Apple's EPS for 2020 would be. 12.72 and would double in the next four years.

Analysts say Apple will not venture into new areas and continue to expand its core business in the wake of its unique success with a market value of 2 trillion. Apple's two main businesses are cloud / software, as well as wearable gadgets such as AirPods and Apple Watch.


Researcher Daryanani also noted that the service sector business will grow faster than Apple's hardware business. The report estimates that Apple will be able to earn 60 billion a year from these two sectors over the next few years.

Google Chrome's dominance in web browsers, a fierce battle between Firefox and Edge

Internet search giant Google's web browser Google Chrome has maintained its dominance in the Internet browser market. According to a recent report by Computer World, Google Chrome has a 68.26 percent share in the desktop web browser.

Other web browsers lag far behind Chrome. Firefox, the second largest browser, has a share of 77.2 percent. Microsoft's Edge is in third place with 6.67 percent of users choosing the Internet browser market.


According to the report, Chrome has not lost its first place in the browser market since 2008 and in recent times other browsers have not been able to reach its side.

Before 2008, Microsoft's Internet Explorer was the most popular web browser. Despite the strong presence of Mozilla Firefox, Internet Explorer had a market share of up to 70 percent at the time.

According to the latest report, Google Chrome is also in the first place in the smartphone browser. Chrome accounted for 64.8 percent of the mobile browser, followed by Apple's Safari browser with a market share of 26.71 percent.

Chrome's popularity among tablet users has waned. According to statistics, Chrome has only 48.59 percent market share in tablet web browsers. Apple's Safari browser accounted for 41.17 percent.

Google recently released Google 83rd Edition. In addition to various other new features, there is also a group tab feature. This feature allows users to rename, color, and move tabs in the tab bar.



Chrome has also added Safe Browsing Mode and Safety Check Tool in the new update. Google says it is constantly taking steps to protect users' data and privacy in the Chrome browser. Agency

GENETIC TESTING

Genetic testing will facilitate doctors seek for missing or defective genes.
This data helps them apprehend if someone, their partner, or their baby is probably going to possess sure medical conditions.
Genetic tests ar once tiny samples of blood or body tissues ar analyzed many various varieties of body fluids and tissues is used the sort of genetic check required to form a designation depends on that condition a doctor checks for.
For genetic testing before birth, a biopsy will screen pregnant girls for a few disorders to see for others, or if the screening biopsy finds a attainable downside, doctors might advocate prenatal diagnosis or villus sampling:
Amniocentesis may be a check typically done between weeks fifteen and twenty of a woman's physiological state.
The doctor inserts a hollow needle into the woman's abdomen to get rid of atiny low quantity of humor from round the developing foetus.


The fluid is checked for genetic issues and may show the sex of the kid.
Once there is risk of premature birth, prenatal diagnosis will show however way the baby's lungs have matured prenatal diagnosis carries a small risk of causing a miscarriage.
Chorionic villus sampling (CVS) typically is completed between weeks ten and twelve of physiological state.
The doctor removes atiny low piece of the placenta to see for genetic issues within the foetus as a result of villus sampling is associate degree invasive check, there is a tiny risk that it will induce a miscarriage.
A doctor might advocate counseling or testing for any of those reasons:
A pregnant woman's possibilities of getting a baby with a body downside (such as trisomy) increase if she is older than thirty four kids of older fathers ar in danger for brand new dominant genetic mutations — those caused by one genetic disease that hasn't run within the family before.
A standard antenatal screening check had associate degree abnormal result.
Doctors might advocate genetic checking if a screening test showed a attainable genetic downside.
A couple plans to start out a family associate degreed one among them or a detailed relative has an familial ill health.
Some individuals ar carriers of genes for genetic sicknesses, even supposing they do not show signs of the ill health themselves.
This happens as a result of some genetic sicknesses ar recessive this implies they cause symptoms given that someone inherits 2 copies of the matter factor, one from every parent kids UN agency inherit one downside factor from one parent however a standard factor from the opposite parent will not have symptoms of a recessive ill health however they'll have a five hundredth probability of passing the matter factor to their kids.
A parent already has one kid with a significant congenital abnormality.
Not all kids UN agency have birth defects have genetic issues.
Sometimes, exposure to a poison (poison), infection, or physical trauma before birth causes a congenital abnormality although a baby contains a genetic downside, it'd not are familial.
Some happen attributable to a spontaneous error within the child's cells, not the parents' cells.
A woman has had 2 or a lot of miscarriages.
Severe body issues within the foetus will generally cause a spontaneous miscarriage many miscarriages might purpose to a genetic downside.
A woman has delivered a stillborn kid with physical signs of a genetic ill health several serious genetic sicknesses cause specific and distinctive physical issues.
A child has medical issues that may be genetic once a baby has medical issues involving over one body system, genetic testing would possibly facilitate doctors realize the cause and create a designation.


A child has medical issues familiar to be a part of a genetic syndrome.
In some cases, it additionally would possibly facilitate realize the sort or severity of a genetic ill health this will facilitate doctors realize the most effective treatment.
Progress in genetic testing has improved however doctors diagnose and treat some sicknesses.
Genetic tests will determine a specific downside factor however they cannot perpetually confirm however that factor can have an effect on the one who carries it.
In mucoviscidosis, for instance, finding a haul factor on body variety seven cannot predict whether or not a baby can have serious respiratory organ issues or milder metabolic process symptoms.
Also, having downside genes is just a part of the story several sicknesses develop from a mixture of speculative genes and environmental things, a number of that someone will management somebody UN agency is aware of they carry speculative genes could be ready to create fashion changes to avoid changing into sick.
Research has known genes that place individuals in danger for cancer, cardiopathy, psychiatrical disorders, and lots of different medical issues.
The hope is to sometime develop specific varieties of factor medical aid to stop some diseases and sicknesses.
Gene medical aid is being studied as a attainable thanks to treat conditions like mucoviscidosis, cancer, and adenosine deaminase deficiency (an immune deficiency), RBC sickness, hemophilia, and Mediterranean anemia however some patients have had severe complications whereas receiving factor medical aid.
Genetic treatments for a few conditions ar an extended manner off however there's still nice hope that a lot of a lot of genetic cures are found.

Genetic testing

Genetic testing involves examining your DNA, the chemical information that carries directions for your body's functions. Genetic testing will reveal changes (mutations) in your genes which will cause malady or illness.

Although genetic testing will give vital info for designation, treating and preventing malady, there ar limitations. as an example, if you are a healthy person, a positive result from genetic testing does not perpetually mean you'll develop a illness. On the opposite hand, in some things, a negative result does not guarantee that you just will not have a particular disorder.

Talking to your doctor, a medical life scientist or a genetic counselor concerning what you'll do with the results is a crucial step within the method of genetic testing.

Genome sequencing

When genetic testing does not result in a identification however a genetic cause remains suspected, some facilities provide ordering sequencing — a method for analyzing a sample of DNA taken from your blood.

Everyone contains a distinctive ordering, created of the DNA altogether of somebody's genes. This complicated testing will facilitate determine genetic variants which will relate to your health. This testing is sometimes restricted to merely viewing the protein-encoding elements of DNA known as the exome.



Why it's done

Genetic testing plays a significant role in decisive the chance of developing bound diseases yet as screening and generally medical treatment. differing types of genetic testing ar finished totally different reasons:

Diagnostic testing. 

If you have got symptoms of a illness which will be caused by genetic changes, generally known as mutated genes, genetic testing will reveal if you have got the suspected disorder. as an example, genetic testing is also wont to ensure a identification of monogenic disease or Huntington's disease.

Presymptomatic and prognosticative testing.

If you have got a case history of a genetic condition, obtaining genetic testing before you have got symptoms could show if you are in danger of developing that condition. as an example, this kind of take a look at is also helpful for distinguishing your risk of bound varieties of large intestine cancer.
Carrier testing. If you have got a case history of a genetic defect — like red blood cell ANemia or monogenic disease — or you are in an grouping that contains a high risk of a selected genetic defect, you will opt to have genetic testing before having youngsters. AN expanded  carrier screening take a look at will observe genes related to a good form of genetic diseases and mutations and might determine if you and your partner ar carriers for an equivalent conditions.

Pharmacogenetics.

If you have got a specific health condition or illness, this kind of genetic testing could facilitate verify what medication and indefinite quantity are best and useful for you.
Prenatal testing. If you are pregnant, tests will observe some varieties of abnormalities in your baby's genes. Down's syndrome and chromosomal anomaly eighteen syndrome ar 2 genetic disorders that ar usually screened for as a part of prenatal  genetic testing. historically this is often done viewing markers in blood or by invasive testing like centesis. Newer testing known as noncellular  DNA testing appearance at a baby's DNA via a biopsy done on the mother.

Newborn screening.

This is often the foremost common style of genetic testing. within the u.  s., all states need that newborns be tested sure enough genetic and metabolic abnormalities that cause specific conditions. this kind of genetic testing is very important as a result of if results show there is a disorder like noninheritable  glandular disease, red blood cell illness or PKU (PKU), care and treatment will begin quickly.

Preimplantation testing. 

Additionally known as preimplantation genetic identification, this take a look at is also used after you decide to conceive a toddler through in vitro fertilization. The embryos ar screened for genetic abnormalities. Embryos while not abnormalities ar planted within the female internal reproductive organ in hopes of achieving physiological condition.

Generally genetic tests have very little physical risk. Blood and cheek swab tests have virtually no risk. However, prenatal  testing like centesis or villus sampling contains a little risk of physiological condition loss (miscarriage).

Genetic testing will have emotional, social and monetary risks yet. Discuss all risks and edges of genetic testing along with your doctor, a medical life scientist or a genetic counselor before you have got a genetic take a look at.



How you prepare

Before you have got genetic testing, gather the maximum amount info as you'll concerning your family's case history. Then, speak along with your doctor or a genetic counselor concerning your personal and family case history to raised perceive your risk. raise queries and discuss any issues concerning genetic testing at that meeting. Also, bring up your choices, betting on the take a look at results.

If you are being tested for a genetic defect that runs in families, you will need to contemplate discussing your call to own genetic testing along with your family. Having these conversations before take a look ating will provide you with a way of however your family would possibly reply to your test results and the way it should have an effect on them.

Not all insurance policies purchase genetic testing. So, before you have got a genetic take a look at, sit down with your insurance supplier to examine what is going to be lined.

In the u.  s., the federal Genetic info Nondiscrimination Act of 2008 (GINA) helps stop health insurers or employers from discriminating against you supported take a look at results. Under GINA, employment discrimination supported genetic risk is also outlaw. However, this act doesn't cowl life, long care or social insurance. Most states provide further protection.

What you'll expect

Depending on the kind of take a look at, a sample of your blood, skin, waters or alternative tissue are collected and sent to a research laboratory for analysis.


For newborn screening tests, a blood sample is taken by prick your baby's heel for a few tests, a swab sample from the within of your cheek is collected for genetic testing.
Amniocentesis. during this antepartum genetic check, your doctor inserts a skinny, hollow needle through your wall and into your womb to gather atiny low quantity of liquid body substance for testing check with your doctor, medical life scientist or genetic counselor before the check concerning once you will expect the results and have a discussion concerning them.
If the genetic check result's positive, which means the genetic modification that was being tested for was detected.
The steps you are taking when you receive a positive result can rely on the explanation you had genetic testing.
Diagnose a selected illness or condition, a positive result can assist you and your doctor verify the correct treatment and management arrange.
Find out if you're carrying a factor that might cause illness in your kid, and also the check is positive, your doctor, medical life scientist or a genetic counselor will assist you verify your child's risk of truly developing the illness.
The check results also can give info to think about as you and your partner build birth control selections.
Determine if you may develop an explicit illness, a positive check does not essentially mean you will get that disorder as an example, having a carcinoma factor (BRCA1 or BRCA2) means that you are at high risk of developing carcinoma at some purpose in your life, however it does not indicate with certainty that you're going to get carcinoma.
However, with some conditions, like chorea, having the altered factor will indicate that the illness can eventually develop.
Talk to your doctor concerning what a positive result means that for you.
In some cases, you'll be able to build way changes which will cut back your risk of developing a illness, although you've got a factor that creates you additional. Results may assist you build decisions associated with treatment, birth control, careers and sum.
In addition, you will opt to participate in analysis or registries associated with your genetic disease or condition.
These choices could assist you keep updated with new developments in bar or treatment.
A negative result means that a mutated factor wasn't detected by the check, which might be encouraging, however it isn't a 100% guarantee that you simply haven't got the disorder.
The accuracy of genetic tests to observe mutated genes varies, betting on the condition being tested for and whether or not or not the point mutation was antecedent known.
Even if you do not have the mutated factor, that does not essentially mean you will ne'er get the illness as an example, the bulk of individuals UN agency develop carcinoma haven't got a carcinoma factor (BRCA1 or BRCA2).

Also, genetic testing might not be ready to observe all genetic defects.
In some cases, a factortic check might not give useful info concerning the gene in question everybody has variations within the means genes seem, and infrequently these variations do not have an effect on your health however typically it are often troublesome to tell apart between a disease-causing factor and a harmless factor variation.
These changes square measure referred to as variants of unsure significance.
In these things, follow-up testing or periodic reviews of the factor over time could also be necessary.
No matter what the results of your genetic testing, speak along with your doctor, medical life scientist or genetic counselor concerning queries or considerations you will have this can assist you perceive what the results mean for you and your family.
Explore salad dressing Clinic studies testing new treatments, interventions and tests as a way to stop, detect, treat or manage this illness.

Machine learning uncovers cell identity regulator by histone code


Curating CIGs

We first performed an intensive query of PubMed using the search expression “cell identity”[Title/abstract] OR “cell marker”[Title/abstract], which returned 7581 PubMed abstracts. We then searched the abstract for names of 297 cell types listed in the SHOGoiN database and ranked cell types by number of associated abstracts. To retrieve CIGs, we then conducted a manual literature review for the ten top-ranked cell types that also have RNA-sequencing (RNA-seq) data and ChIP-Seq data for the histone modifications of H3K4me3, H3K4me1, H3K27ac, and H3K27me3. We also defined control genes by requiring that their names did not appear together with the name of the given cell type in literature or any of five major annotation database, i.e., the Entrez Gene, Gene Cards, Ensembl, Gene Ontology, and KEGG. We then further selected a random subset of the control genes, so that number of control genes in the subset is the same as number of our curated CIGs.


Data collection

The RNA-seq, H3K4me3, H3K4me1, H3K27ac, and H3K27me3 ChIP-seq data for the ten well-defined cell types (H1-hESC, CD34 + HPC, B cell, HUVECs, human mammary epithelial cells, neural cells, MRG cell, normal human lung fibroblast, mesenchymal stem cell, human skeletal muscle myoblast) and landscape analysis are downloaded from GEO database and ENCODE project website (https://www.encodeproject.org/)38.



ChIP-seq and RNA-seq data analysis

Human reference genome sequence version hg19 and UCSC Known Genes were downloaded from the UCSC Genome Browser website39. RNA-seq raw reads were mapped to the human genome version hg19 using TopHat version 2.1.1 with default parameter values. Expression value for each gene was determined by the function Cuffdiff in Cufflinks version 2.2.1 with default parameter values. Afterwards, quantile normalization of gene expression values was performed across cell and tissue types.



For ChIP-seq data, reads were first mapped to hg19 human genome by Bowtie version 1.1.0:



bowtie -p 8 -m 1 --chunkmbs 512 –best hg19_reference_genome fastq_file



Wig file is generated using DANPOS 2.2.3:



python danpos.py dpeak sample –b input --smooth_width 0 -c 25000000 --frsz 200 --extend 200 –o output_dir



Quantile normalization is performed using DANPOS 2.2.3:



python danpos.py wiq --buffer_size 50 hg19.chrom.sizes.xls wig –reference reference.qnor.sort.wiq --rformat wiq --rsorted 1



By this method, ChIP-seq data from different cell and tissue types were all normalized to have the same quantiles.



Bigwig is generated using the tool WigToBigWig with the following command line:



wigToBigWig -clip sample.bgsub.Fnor.wig hg19.sizes.xls sample.bw



The tool WigToBigWig was downloaded from the ENCODE project website (https://www.encodeproject.org/software/wigtobigwig/)38. The “hg19.sizes.xls” in the command line is a file containing the length of each chromosome in the human genome. We then submitted the bigWig file to the UCSC Genome Browser (https://genome.ucsc.edu) to visualize ChIP-seq signal at each base pair39,40.



Peak calling and feature value calculation are performed using the GridGO function in the CEFCIG framework (detailed algorithm is described below).



In feature value calculation, skewness and kurtosis values are centered on zero. If no signal is being detected in the peak calling region, skewness and kurtosis are set to zero.



GridGO algorithm

We developed GridGO, a grid-based genetic method to optimize bioinformatics parameters for detecting epigenetic signature of CIGs. We use GridGO to optimize three important parameters, including the height cutoff to define ChIP-seq enrichment peak, the upstream distance cutoff to assign a peak to a nearby gene, and the downstream distance cutoff to assign a peak to a nearby gene. However, GridGO is designed to also allow optimizing different number of parameters. For simplicity, we will describe details of the algorithm by an example in which the upstream and downstream distance cutoffs to assign a ChIP-seq peak to nearby genes are set to be the same, so that GridGO will optimize only two parameters including the height cutoff to define ChIP-Seq enrichment peak and the distance cutoff to assign a peak to a nearby gene (Supplementary Fig. 2a). In the first iteration of optimization, the entire two-dimensional parameter space will be divided into m equal-size grids. Then the parameter values in the center of each grid will be used to define ChIP-Seq enrichment peaks and to assign the peaks to nearby genes. Afterwards, P-value of difference in a peak feature (epigenetic signature) between CIGs and control genes will be determined by Wilcoxon’s test. The grid with the lowest P-value will be the optimal grid saved for the second iteration. In the second iteration, the grid saved in the first iteration will be divided into a new set of m small grids, which will be tested as the previous iteration to select an optimal grid saved for the third iteration. Such genetic evolution of parameter grid keeps going until the number of iteration become larger than a given value n or the new optimal grid is not better than the previous optimal grid. To estimate a potential overfitting effect, we used only 80% of training genes in the GridGo optimization and build the CIGdiscover model based on parameters optimized by these genes. Then the performance of CIGdiscover on these 80% genes and the rest 20% genes were compared, and little overfitting effect was observed.



Backward and forward feature selections

In backward feature elimination, all features are included in the model at the beginning. In each round of iteration, after trying to remove individual features from the model and test the influence on the model, one feature with least impairment to the performance of the model is removed. In contrast, in forward feature construction, there is no feature in the model in the beginning. In each round of iteration, after trying to add individual features from the feature pool and test the influence on the model, the feature that led to the best improvement to the model was added into the model. The performance is measured by the closest distance between ROC curve and the top left corner of the panel.



Specifically, in an iteration I of the forward feature construction process (Supplementary Fig. 3a right section), let Si−1 = [s1, s2, …, si−1] be the combination of features selected by the previous i−1 iterations, and let Ci−1 = [ci, ci+1, …, cn] be the remaining candidate features. Our algorithm will combine ci with Si−1 to form a new candidate combination and evaluate the performance of this combination by 100 times cross-validations. Similarly, the algorithm will combine ci+1, ci+2, …, or cn with Si−1 to form n − i additional candidate combinations, and evaluate the performance of each candidate combination by 100 times cross-validations. Among these n − i + 1 candidate combinations, the combination that shows the best performance will be the combination Si selected by iteration i.



Training CIGdiscover

Logistic regression model is built on the base of the Sklearn logistic regression library. Data are centralized and normalized using the Preprocessing library in Sklearn. Cross-validation is repeated 100 times by splitting the data into 80% training and 20% test data. Penalty is set as L1 to prevent overfitting in all experiments.



We can denote the response for case i as yi, the jth predictor for case i as xij, the regression coefficient and the intercept corresponding to the jth predictor as βj and μ. Let θ = (μ,β1, … ,βp)t and xi = (xi = (xi1, … ,xip)t, we estimate the parameter vector θ by maximizing the log-likelihood



$$Lleft( theta right) = mathop {sum}limits_{i = 1}^n {left[ {y_ilog left( {p_i} right) + left( {1 - y_i} right)log left( {1 - p_i} right)} right]}$$



(1)



The Lasso method is implemented by fixing an upper bound on the sum of the absolute value of the model parameters, which can be denoted by penalizing the negative log-likelihood with L1-norm. In the Logistic regression model, the negative log-likelihood is denoted by



$$- mathop{sum}limits_{i = 1}^{n} log left( p_{beta} left( {y_{i}{mathrm{|}}x_{i}} right) right) = mathop {sum}limits_{i = 1}^{n} left{-y_{i} left(mathop {sum}limits_{j = 0}^{p} beta_{j} x^{left( j right)}right) + {mathrm{log}}left(1 + {mathrm{exp}}left(mathop{sum}limits_{j = 0}^{p} beta_{j} x^{(j)}right)right)right}$$



(2)



The loss function ρ can be written as



$$rho _{left( beta right)}(x,y) = - yleft(mathop{sum}limits_{j = 0}^{p} beta_{j} x^{left( j right)}right) + log left( 1 + exp left( mathop{sum}limits_{j = 0}^{p} beta_{j} x^{left( j right)} right) right)$$



(3)



The Lasso estimator of a Logistic regression model is defined as



$$hat beta left( lambda right) = {mathrm{{argmin}}}_beta left( {n^{ - 1}mathop {sum}limits_{i = 1}^n {rho _{left( beta right)}left( {x_i,y_i} right) + lambda parallel beta parallel _1} } right)$$



(4)



For each gene, the signed distance to the hyperplane was used as CIG score. The decision threshold (CIG score cutoff) for CIGs were determined by distance to the top left corner of the ROC curve41.



P-value between a pair of ROC curves was calculated by Hanley’s method42. First, a critical ratio z will be defined as:



$$z = frac{{A_1 - A_2}}{{sqrt {{mathrm{SE}}_1^2 + {mathrm{SE}}_2^2 - 2r{mathrm{SE}}_1{mathrm{SE}}_2} }}$$



(5)



where A1 and SE1 are the observed area under curve and estimated SE of area under curve for ROC curve 1; where A2 and SE2 are the associated values for ROC curve 2. r represents the correlation between A1 and A2 via querying the table provided in Hanley’s method42. Two intermediate correlation coefficients are required to calculate r. First, rcig, is the Pearson correlation between the CIG scores given to CIGs by the two models; rnoncig, is the Pearson correlation between the CIG scores given to non-CIG genes by the two models. Furthermore, r is acquired by querying the table42 using (rcig + rnoncig)/2 and (A1 + A2)/2. SE of the ROC areas are calculated based on the following equation43.



$${mathrm{{SE}}} = sqrt {frac{{Aleft( {1 - A} right) + left( {{mathrm{na}} - 1} right)left( {Q_1 - A * A} right) + ({mathrm{nn}} - 1)(Q_2 - A * A)}}{{{mathrm{na}} * {mathrm{nn}}}}}$$



(6)



where A is the area under the curve, na and nn are the number of control genes and CIG genes, respectively, and Q1 and Q2 are estimated by: Q = A/(2 − A), Q = 2 A ∗ A/(1 + A). Then this quantity z is referred to tables of normal distributions and used to estimate p-value between the two ROC curves.



Correlation and collinearity test

Spearman correlation coefficient between features was calculated using the Python Pandas library. For collinearity analysis, variance inflation factor is calculated using the SciPy library.



Analyze the influence of cell types

For cross test, identity genes from only five randomly selected cell types were used to train the model and identity genes from the other cell types were used to test the model. Overlapped genes are removed from the training and test datasets. Similarly, to analyze how the number of cell types influence the model, identity genes that were used for training and for testing were from different cell types. For parallel test, genes that were used for training and testing were from the same cell types.

Analyze the influence of noises

To test the robustness of the model, labels of genes were swapped in four different ways: only swap identity genes to negative control genes (false negative), only swap negative control genes to identity genes (false positive), swap equal number of identity genes to control genes and control genes to identity genes (bidirectional false), and randomly change the labels of genes. After swapping, the genes were subject to CIGdiscover to test its performance.

Training CIGnet

Data related to network nodes (genes), edges, and closeness between nodes were downloaded from CellNet website (http://cellnet.hms.harvard.edu/)1. For each cell-type-specific cell identity subnetwork, only the CIGs were used. Known master transcription factors were defined as in Fig. 1. Control transcription factors are randomly selected from all known transcription factors, except the master transcription factors. Due to the small number of master transcription factors in the training datasets, SMOTE is used to expand the positive genes in training datasets following the distribution of feature values of the known master transcription factors. All performance tests for the logistic regression model are conducted in the same way as described for CIGdiscover. Cell-type specificity is calculated using Tau index44 and scaled to be between −1 and 1.

Pathway analysis

Top 500 identity genes ranked by the CIG score are selected to perform the DAVID pathway analysis (DAVID 6.8 https://david.ncifcrf.gov/)45. The pathways with q-values (adjusted P-values using the Benjamini method) smaller than 0.05 were defined as significantly enriched.

Cell lineage hierarchy analysis

For heatmap and hierarchy cluster analysis, we retrieved top 500 identity genes for each cell or tissue type to create a heatmap of cell identity score using the tool Morpheus (https://software.broadinstitute.org/morpheus) with default parameters.

Data visualization

Decision boundary analysis is performed using Sklearn and visualized by Matplotlib. ROC curves are created using Matplotlib. Bar plots is created using the Prism statistical software package (Graph Pad Software, Inc., La Jolla, CA, USA). Scatter plots are created using Microsoft Excel software.

Average density of epigenetic marks in promoter region around TSS were plotted using the Profile function in DANPOS version 2.2.3:

python danpos.py profile wig --genefile_paths putative_identity_genes.txt,putative_negative_identity_genes.txt --genefile_aliases positive,negative --heatmap 1 --name outdir --genomic_sites TSS --flank_up 3000 --flank_dn 10000

Heatmap for density of epigenetic marks around TSS is plotted using the software MeV46 version 4.8.1.

We have added figures for visualization of CIG networks for individual cell or tissue types in the “network visualization section” of our CIGDB at https://sites.google.com/view/cigdb/predicted-db/network-visualization

Maintenance of human PSCs

Human PSCs were maintained on Matrigel in mTesR1 medium. Cells were passaged approximately every 6 days. To passage PSCs, cells were washed with Dulbecco’s modified Eagle’s medium (DMEM)/F12 medium (no serum) and incubated in 1 mg/ml dispase until colony edges started to detach from the dish. The dish was then washed three times with DMEM/ F12 medium. After the final wash, colonies were scraped off of the dish with a cell scraper and gently triturated into small clumps and passaged onto fresh Matrigel-coated plates.

Human PSCs differentiation to ECs

Differentiation is induced 4 days after PSCs passaging (day 0). Mesoderm specification is induced by the addition of bone morphogenetic protein 4, activin A, small-molecule inhibitor of glycogen synthase kinase-3β (CHIR99021), and vascular endothelial growth factor (VEGF). Mesoderm inductive factors are removed on day 3 of differentiation and are replaced with vascular specification medium supplemented with VEGF and the transforming growth factor-β pathway small-molecule inhibitor SB431542. Vascular specification medium is additionally refreshed on days 7 and 9 of differentiation. Flow cytometric analysis of differentiated ECs is performed on day 10.

CRISPR gRNA and lentiviral vector design

Two open-access software, Cas-Designer (http://www.rgenome.net/cas-designer/) and CRISPR design (http://crispr.mit.edu/), were used to design guide RNAs (gRNA) targeted to candidate gene. Two guides were designed per gene shown as in Supplementary Data 4.

Target DNA oligos were purchased from IDT (Integrated DNA Technologies) and cloned into the lentiCRISPR v2 plasmid2 (Addgene plasmid# 52961) via BsmBI restriction enzyme sites upstream of the scaffold sequence of the U6-driven gRNA cassette. All plasmids were sequenced to confirm successful ligation.

Lentiviral constructs

Lentivirus was packaged by co-transfection of constructs with second-generation packaging plasmids pMD2.G, psPAX2 into a six-well plate with HEK293T cells. After the first 24 h of transfection (250 ng of pMD2.G, 750 ng of psPAX2, 1 µg of target plasmid), the medium was changed to DMEM and the supernatants 48 and 72 h after transfection were pooled, filtered through a 0.45 µm filter, and used for infection.

Cell culture and lentiviral transduction

HUVECs were purchased from Lonza (C2517A) and human pluripotent stem cell (hPSC) was a kind gift from Dr John Cooke’s group. All cells used in this study were within 15 passages after receipt. HUVECs were cultured in 5% CO2 and maintained in vitro in Endothelial Growth Basal Medium with EGM-2 SingleQuot Kit. hPSC was cultured in 5% CO2 and maintained in vitro in mTeSR1 basal medium with mTeSR1 supplement. Those cell lines were mycoplasma negative during routine tests. HUVECs were grown to 70% confluence and infected with lentiviral vectors. The media was changed 8 h after viral transduction and incubated for 48 h before selection with 1 µg/mL puromycin for 3 days. HUVECs were collected for cell proliferation assay, nitric oxide production, and genomic DNA extraction. On the second day of PSC passaging, PSCs were infected with lentiviral vectors. The media was changed 6 h after viral transduction. Differentiation was induced 3 days after virus infection.

T7 endonuclease I assay

Genomic DNA from lentiviral transduced cells were extracted with a Quick-DNA Miniprep Kit (Zymo Research, Irvine, CA) following manufacturer’s protocol and were quantified using a Synergy 2 Multi-Mode Reader (BioTek, Winooski, VT, USA). The targeted regions were PCR-amplified with amfiSure PCR Master Mix (GenDEPOT, Barker, TX, USA) using primers flanking the target sites. Primers sequence are shown in Supplementary Data 4.

We denatured 200 ng of the PCR products and then slowly hybridized to form heteroduplexes using the following program settings: 95 °C for 5 min, 95°–85 °C at −2 °C/s, 85°–25 °C at −0.1 °C/s. Heteroduplexes were digested with T7 endonuclease I (New England Biolabs, Ipswich, MA, USA) at 37 °C for 30 min. In addition, the digested products were separated on a 2% TAE agarose gel for analysis. Images were captured using the ChemiDoc XRS+ Molecular Imager system (Bio-Rad, Hercules, CA, USA).

Cell proliferation assay

After viral transduction and puromycin selection, HUVECs were plated in 96-well plates at a density of 1000 cells per well and allowed to attach for 24 h. Viability was measured utilizing the CellTiter-Glo® Luminescent Cell Viability Assay (Promega, Madison, WI, USA). Results were read at 24, 48, and 72 h on the on Synergy 2 Multi-Mode Reader.

Nitric oxide production assay

HUVECs (8 × 103) were added to triplicate wells of a 96-well plate with 200 ml media. 4-amino-5methylamino-2979-difluorofluorescein diacetate (10 mM) in anhydrous dimethylsulfoxide was added to each well and the plate was incubated for 30 min at 37 °C and 5% CO2. The cells were washed with PBS, 200 ml fresh media was added, and the plate was incubated for an additional 30 min. Fluorescence was measured using a Synergy 2 Multi-Mode Reader.

Endothelial cell tube formation assay

Ninety-six-well plates were coated with 50 μl Matrigel (R&D Systems, catalog number 3432-005-01) and incubated at 37 °C for 30 min. Control (1 × 104) and CRISPR/Cas9-edited HUVECs in 100 μl EGM medium were seeded in each well, respectively. After 4 h, images were captured using the Leica epi-fluorescence microscope. Branches number and length were quantified using ImageJ software.

Fluorometric LDL uptake assay

Control and CRISPR/Cas9-edited HUVECs were seeded on a 48-well plate. Alexa Fluor 594 AcLDL (Thermo Fisher Scientific, catalog number L35353) was added to the culture medium for the final 4 h of the incubation time. HUVECs were washed, trypsinized, and centrifuged at 200 × g for 5 min, then resuspended in FACSB-10. Fluorescence was determined using a flow cytometer (LSR II, Becton-Dickinson, San Jose, CA, USA) and the data were analyzed using FlowJo software.

Flow cytometric analysis

Ten days after differentiation, PSCs were trypsinized, centrifuged at 200 × g for 5 min, resuspended in FACSB-10 (FACS buffer–10% fetal bovine serum) and incubated with anti-human-VE-cadherin (Invitrogen, catalog number 17-0319-41, 1:150) and anti-human CD31 (Invitrogen, catalog number 53-1449-41, 1:150) for 30 min on ice. Fluorescence was determined using a flow cytometer (LSR II, Becton-Dickinson, San Jose, CA, USA) and the data were analyzed using FlowJo software.

Statistical analysis

For bar plots and box plots, P-values are calculated with Wilcoxon’s test (two-tail). For ROC curves, P-values are calculated by Hanley’s method42 (two-tail). Q-values (Benjamini) for pathways were directly determined using DAVID (https://david.ncifcrf.gov/). For CRISPR9 experiments, data were presented as mean ± SD of six individual experiments. Statistical analysis was performed with Student’s T-test by means of the Prism statistical software package (Graph Pad Software, Inc., La Jolla, CA, USA).



Popular Posts