Sentences with phrase «on subsets of data»

The Berkeley researchers developed their own statistical methods so that they could use data from virtually all of the temperature stations on land — some 39,000 in all — whereas the other research groups relied on subsets of data from several thousand sites to build their records.
In any case, based on this subset of data, there is evidence that, as the influence of female investors in partnerships expands, there is likely to be a benefit for female founders.
Christakis study drew on a subset of those data, a social network that includes about 5000 individuals and more than 7000 of their parents, siblings, spouses, and friends.
The process of making our fit based on data is called training, and it's standard practice to train one's model on a subset of data and then test one's model against a different subset.

Not exact matches

Some of the kinds of transactions that Bitcoin can support include so - called M of N transactions, which require agreement between a certain subset of a group, and can be used for escrow, mediation, or shared financial management; time - locked transactions, in which bitcoins are distributed on a strict schedule, useful for trusts or wills; and even data - conditional transactions, in which a script uses a data input such as a regular Google search to monitor real - world events that would automatically trigger disbursements or other actions.
The analysis of the Task Force is based on 1992 tax data and focuses on the subset of the population that has: made C / QPP contributions that year; relies on earnings from employment and self - employment as its major source of income; is between ages 25 and 65; and has annual income between $ 20,000 and $ 80,000.
The project is detailed in the contract as a seven step process — with Kogan's company, GSR, generating an initial seed sample (though it does not specify how large this is here) using «online panels»; analyzing this seed training data using its own «psychometric inventories» to try to determine personality categories; the next step is Kogan's personality quiz app being deployed on Facebook to gather the full dataset from respondents and also to scrape a subset of data from their Facebook friends (here it notes: «upon consent of the respondent, the GS Technology scrapes and retains the respondent's Facebook profile and a quantity of data on that respondent's Facebook friends»); step 4 involves the psychometric data from the seed sample, plus the Facebook profile data and friend data all being run through proprietary modeling algorithms — which the contract specifies are based on using Facebook likes to predict personality scores, with the stated aim of predicting the «psychological, dispositional and / or attitudinal facets of each Facebook record»; this then generates a series of scores per Facebook profile; step 6 is to match these psychometrically scored profiles with voter record data held by SCL — with the goal of matching (and thus scoring) at least 2M voter records for targeting voters across the 11 states; the final step is for matched records to be returned to SCL, which would then be in a position to craft messages to voters based on their modeled psychometric scores.
The data was acquired and processed by Cambridge University professor Aleksandr Kogan whose personality quiz app, running on Facebook's platform in 2014, was able to harvest personal data on tens of millions of users (a subset of which Kogan turned into psychological profiles for CA to use for targeting political messaging at US voters).
The data comes from approximately 100,000 Intuit QuickBooks Online customers, a subset of the total QuickBooks Online user base, and is published monthly on a close to real - time basis.
It contains all the Sustainability data displayed on our website and a subset of the case studies.
No additional data were extracted from the additional report of Tracy 2013 which reports on the number of midwives and health professionals seen by a subset of publicly funded pregnant women.
In this chart showing a subset of the data on how long primates sleep, humans stand out as snoozing the fewest hours daily, on average.
Any results that are reported to constitute a blinded, independent validation of a statistical model (or mathematical classifier or predictor) must be accompanied by a detailed explanation that includes: 1) specification of the exact «locked down» form of the model, including all data processing steps, algorithm for calculating the model output, and any cutpoints that might be applied to the model output for final classification, 2) date on which the model or predictor was fully locked down in exactly the form described, 3) name of the individual (s) who maintained the blinded data and oversaw the evaluation (e.g., honest broker), 4) statement of assurance that no modifications, additions, or exclusion were made to the validation data set from the point at which the model was locked down and that neither the validation data nor any subset of it had ever been used to assess or refine the model being tested
But a subset of the data might be made available on a Web site operated by US Strategic Command, and researchers might be able to get further data on request, says Gambrell.
They remove a small subset of the data, and if their model can accurately replicate the missing pieces, the scientists may be on the right track.
The project uses data and software to simulate a small subset of a rat's brain, focusing on a collection of neurons known as a cortical column.
«Can we find consistent results that are not dependent on a particular data set or a particular subset of subjects?»
The team has therefore proposed a new taxonomy for EECs based on their expression profile data, with many new subsets of hormone - producing cells.
This chart shows a subset of data on how long primates sleep.
We therefore performed two independent phylogenetic analyses based on larger subsets of the nuclear data, whole - exome sequences and all polymorphic sites, to reconstruct the most likely relationships between ancient and modern horses.
Combined with our data on ILC3s, it appears that B. anthracis modulates many subsets of innate lymphocytes, potentially through conserved mechanisms.
However, comparison of the global, annual mean time series of near - surface temperature (approximately 0 to 5 m depth) from this analysis and the corresponding SST series based on a subset of the International Comprehensive Ocean - Atmosphere Data Set (ICOADS) database (approximately 134 million SST observations; Smith and Reynolds, 2003 and additional data) shows a high correlation (r = 0.96) for the period 1955 to 2Data Set (ICOADS) database (approximately 134 million SST observations; Smith and Reynolds, 2003 and additional data) shows a high correlation (r = 0.96) for the period 1955 to 2data) shows a high correlation (r = 0.96) for the period 1955 to 2005.
The NEPC report presents information comparing the finances of a subset of K12 - operated schools with other schools in the same states, but it is hard to interpret the spending data without good information on the performance of K12 schools.
With passage of the Local Control Funding Formula, California became the first state to require schools to consider how best to serve a small subset of at - risk students: youth in foster care.According to 2016 California Department of Education data, in English language arts, 56.2 percent of foster students did not meet standards on the Smarter Balanced tests (compared to 30.5 percent of non-foster students) and for mathematics, 64 percent of foster students did not meet standards (compared to 37.3 percent of non-foster students).
On any given day, based on the data, teachers may gather an entire grade or a subset of students, sometimes in groups as small as one or twOn any given day, based on the data, teachers may gather an entire grade or a subset of students, sometimes in groups as small as one or twon the data, teachers may gather an entire grade or a subset of students, sometimes in groups as small as one or two.
We focused on a subset of PTAs in Montgomery and Anne Arundel counties for which we had complete data over three recent FYs — 2012, 2013, and 2014.
The legislation insists on «every» student achieving and that schools use subsets of data to make certain that no child is overlooked.
CCSA's Accountability Framework is made up of two parts - an initial review of publicly available test score and postsecondary readiness data and then, for the subset of schools underperforming on all initial criteria, a Multiple Measures Review based on public and non-public data that is tailored to a school's mission and outcomes.
In addition, a limited subset of data from DIN SAR is available as a PDF file on the FAA.Gov public website.
At best, Nielsen is capturing data on a far smaller subset of both markets than it claims.
The TD Ameritrade data supports this conclusion on the subset of professionals that are RIAs, and charging on a percentage of assets model.
The best studies tend to test the robustness of their conclusions by dropping various subsets of data or by excluding whole classes of data (such as tree - rings) in order to see what difference they make so you won't generally find that too much rides on any one proxy record (despite what you might read elsewhere).
She says she's deeply troubled by the tribal nature of some subsets of the climate science community and what she sees as ill - advised stonewalling on releasing data and interpretations of data for review and independent analysis.
Those uncertainties are based on issues of interpolation, homogenization (for non-climatic changes in location / measurements) etc. and have been evaluated multiple ways — including totally independent homogenization schemes, non-overlapping data subsets etc..
That could add up: an earlier LA Times investigation (h / t DeSmogBlog), based on data from Friends of the Earth, counted up at least 12 billion tonnes of carbon dioxide from just a subset of the projects the two funded in 1993 - 2006.
However, comparison of the global, annual mean time series of near - surface temperature (approximately 0 to 5 m depth) from this analysis and the corresponding SST series based on a subset of the International Comprehensive Ocean - Atmosphere Data Set (ICOADS) database (approximately 134 million SST observations; Smith and Reynolds, 2003 and additional data) shows a high correlation (r = 0.96) for the period 1955 to 2Data Set (ICOADS) database (approximately 134 million SST observations; Smith and Reynolds, 2003 and additional data) shows a high correlation (r = 0.96) for the period 1955 to 2data) shows a high correlation (r = 0.96) for the period 1955 to 2005.
Because of differences among the start, end and total length of the station records they examined, the authors decided to perform their analyses on three subsets of the data.
However, when a validation was performed on a similar analysis for which the regression model was calibrated with a subset of the data, and the remaining data were used for validation, it became apparent that models based on the factors that McKitrick & Michaels used had no skill (i.e. were not able to reproduce the independent data).
After I manually transcribed and analysed relevant data from a subset of the first batch of over 4,000 scanned A8 forms received on 28 October, I wrote to Minister Freydenberg on 12 November explaining that the values recorded manually on the A8 forms from the mercury thermometers for the period November 1996 to December 2000 at Mildura are significantly different from the official values recorded from the electronic probes.
NH temperature reconstructions based on all records, and on subsets excluding or comprising exclusively of, tree - ring data.
Can't find a recent item on arctic sea ice but hoped it might be worth pointing out Peng et al Sensitivity Analysis of Arctic Sea Ice Extent Trends and Statistical Projections Using Satellite Data http://www.mdpi.com/2072-4292/10/2/230/htm «The most persistently probable curve - fit model from all the methods examined appears to be Gompertz, even if it is not the best of the subset for all analyzed periods.
Regardless of your position on the reliability of our adjustments it is clear that combining these two independent subsets of the data in time - varying proportions without adjustment — as happens in the ICOADS data base — will lead to variability in the global SST series that is not climatic in origin.
PCA was performed as the first step (after areal adjustment) on the gridded instrumental data, 1902 — 1995 and the individual proxy series from 1902 — 1980 were calibrated against the corresponding EOFs of the instrumental data matrix by singular value decomposition to determine retention of reconstructed PCs for each proxy series and then tested for robustness against the 1854 — 1902 validation period as well as a smaller subset of instrumental / historical EOFs going back to the 16th century.
This post will help you understand that basing an RCS chronology on a fit to a subset of the data is fundamentally contrary to the what RCS is trying to achieve.
Machine learning is a subset of AI — a powerful application of AI technology in which we expose machines to lots of data and provide them with ways to learn on their own and become incrementally «smarter» over time.
Where we disagree is on whether these should included all or only a subset of the data.
Here, for example, is a linear regression I ran on a subset of the VA court data.
New test data on a particular subset of Takata airbag inflators show a far higher risk of ruptures during airbag deployment, according to a recent article published by Brett Emison, partner at Langdon & Emison.
Christine A. Amalfe, President of the NAWL Foundation and Director at Gibbons P.C. in Newark, NJ, described the survey as «the only national study of the nation's 200 largest law firms, which annually tracks the progress of women lawyers at all levels of private practice, including the most senior positions, and collects data on firms as a whole rather than from a subset of individual lawyers.»
The project is detailed in the contract as a seven step process — with Kogan's company, GSR, generating an initial seed sample (though it does not specify how large this is here) using «online panels»; analyzing this seed training data using its own «psychometric inventories» to try to determine personality categories; the next step is Kogan's personality quiz app being deployed on Facebook to gather the full dataset from respondents and also to scrape a subset of data from their Facebook friends (here it notes: «upon consent of the respondent, the GS Technology scrapes and retains the respondent's Facebook profile and a quantity of data on that respondent's Facebook friends»); step 4 involves the psychometric data from the seed sample, plus the Facebook profile data and friend data all being run through proprietary modeling algorithms — which the contract specifies are based on using Facebook likes to predict personality scores, with the stated aim of predicting the «psychological, dispositional and / or attitudinal facets of each Facebook record»; this then generates a series of scores per Facebook profile; step 6 is to match these psychometrically scored profiles with voter record data held by SCL — with the goal of matching (and thus scoring) at least 2M voter records for targeting voters across the 11 states; the final step is for matched records to be returned to SCL, which would then be in a position to craft messages to voters based on their modeled psychometric scores.
a b c d e f g h i j k l m n o p q r s t u v w x y z